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Abstract —Mobile micro-clouds are promising for enabling performance-critical cloud applications. However, one challenge therein is 
the dynamics at the network edge. In this paper, we study how to place service instances to cope with these dynamics, where multiple 
users and service instances coexist in the system. Our goal is to find the optimal placement (configuration) of instances to minimize 
the average cost over time, leveraging the ability of predicting future cost parameters with known accuracy. We first propose an offline 
algorithm that solves for the optimal configuration in a specific look-ahead time-window. Then, we propose an online approximation 
algorithm with polynomial time-complexity to find the placement in real-time whenever an instance arrives. We analytically show that 
the online algorithm is O(l)-competitive for a broad family of cost functions. Afterwards, the impact of prediction errors is considered 
and a method for finding the optimal look-ahead window size is proposed, which minimizes an upper bound of the average actual 
cost. The effectiveness of the proposed approach is evaluated by simulations with both synthetic and real-world (San Francisco taxi) 
user-mobility traces. The theoretical methodology used in this paper can potentially be applied to a larger class of dynamic resource 
allocation problems. 
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1 Introduction 

Many emerging applications, such as video stream¬ 
ing, real-time face/object recognition, require high data 
processing capability. However, portable devices (e.g. 
smartphones) are generally limited by their size and 
battery life, which makes fhem incapable of performing 
complex compufafional fasks. A remedy for fhis is fo 
ufilize cloud computing fechniques, where fhe cloud 
performs fhe compufafion for ifs users. In fhe fradifional 
seffing, cloud services are provided by cenfralized dafa- 
cenfers fhaf may be locafed far away from end-users, 
which can be inefficient because users may experience 
long latency and poor connectivity due to long-distance 
communication Q. The newly emerging idea of mobile 
micro-clouds (MMCs) is fo place fhe cloud closer fo end- 
users, so fhaf users can have fasf and reliable access fo 
services. A small-sized server clusfer hosting an MMC 
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Figure 1. Application scenario. 

is direcfly connecfed fo a network componenf af fhe nef- 
work edge. For example, if can be connecfed fo fhe wire¬ 
less basesfafion, as proposed in 0 and Q, providing 
cloud services fo users fhaf are eifner connecfed fo fhe 
basesfafion or are wifhin a reasonable disfance from if. It 
can also be connected to other network entities that are 
in close proximity to users. Fig. shows an application 
scenario where MMCs coexist with a backend cloud. 
MMCs can be used for many applications fhaf require 
high reliabilify or high dafa processing capabilify Q. 
Similar concepfs include cloudlef pi, follow me cloud 
I fog compufing, edge computing M, small cell cloud 
l^, efc. We use fhe ferm MMC in fhis paper. 

One imporfanf issue in MMCs is fo decide which 
MMC should perform the computation for a parficular 
user or a sef of users, faking info accounf user mobilify 
and ofher d 5 mamic changes in fhe network. Providing a 
service to a user (or a set of users) requires sfarfing a 
service insfance, which can be run eifher in fhe backend 
cloud or in one of fhe MMCs, and fhe quesfion is how fo 
choose fhe optimal location fo run fhe service insfance. 
Besides, users may move across differenf geographical 
areas due fo mobilify, fhus anofher quesfion is whefher 
we should migrafe fhe service insfance from one cloud 
(which can be eifher an MMC or fhe backend cloud) 
fo anofher cloud when fhe user locafion or nefwork 
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condition changes. For every cloud, there is a cosj^ 
associated with running the service instance in it, and 
there is also a cost associated with migrating the service 
instance from one cloud to another cloud. The placement 
and migration of service insfances fherefore needs fo 
properly fake info accounf fhis cosf. 


1.1 Related Work 

The abovemenfioned problems are relafed fo appli- 
cafion/workload placemen! problems in cloud environ- 
menfs. Alfhough exisfing work has sfudied such prob¬ 
lems under complex network fopologies 0, fhey 
mainly focused on relafively sfafic network conditions 
and fixed resource demands in a data-center environ¬ 
ment. The presence of d 3 mamically changing resource 
availability that is related to user mobility in an MMC 
environment has not been sufficiently considered. 

When user mobility exists, it is necessary to consider 
real-time (live) migration of service instances. For ex¬ 
ample, it can be beneficial to migrate the instance to a 
location closer to the user. Only a few existing papers in 
the literature have studied this problem |10 |-p^. The 
main approach in |10|-|12| is to formulate the mobility- 
driven service instance migration problem as a Markov 
decision process (MDP). Such a formulation is suitable 
where the user mobility follows or can be approximated 
by a mobility model that can be described by a Markov 
chain. Flowever, there are cases where the Markovian 
assumption is not valid p3) . Besides, p^-p2| either 
do not explicitly or only heuristically consider multiple 
users and service instances, and they assume specific 
structures of the cost function that are related to the loca¬ 
tions of users and service instances. Such cost structures 
may be inapplicable when the load on different MMCs 
are imbalanced or when we consider the backend cloud 
as a placement option. In addition, the existing MDP- 
based approaches mainly consider service instances that 
are constantly rurming in the cloud system; they do not 
consider instances that may arrive to and depart from 
the system over time. 

Systems with online (and usually unpredictable) ar¬ 
rivals and departures have been studied in the field 
of online approximation algorithms |^, p3] . The goal 
is to design efficient algorithms (usually with pol 5 mo- 
mial time-complexity) that have reasonable competitive 
ratio^ However, most existing work focus on prob¬ 
lems that can be formulated as integer linear programs. 
Problems that have convex but non-linear objective func¬ 
tions have attracted attention only very recently 
0, where the focus is on online covering problems in 
which new constraints arrive over time. Our problem is 
different from the existing work in the sense that the 
online arrivals in our problem are abstracted as change 
in constraints (or, with a slightly different but equivalent 


1. The term "cost" in this paper is an abstract notion that can stand 
for monetary cost, service access latency of users, service interruption 
time, amount of transmission/processing resource consumption, etc. 

2. We define the competitive ratio as the maximum ratio of the cost 
from the online approximation algorithm to the true optimal cost from 
offline placement. 


formulation, adding new variables) instead of adding 
new constraints, and we consider the average cost over 
multiple timeslots. Meanwhile, online departures are not 
considered in KID, 0. 

Concurrently with the work presented in this paper, 
we have considered non-realtime applications in p8) , 
where users submit job requests that can be processed 
after some time. Different from [ [18| , we consider users 
continuously connected to services in this paper, which 
is often the case for delay-sensitive applications (such as 
live video streaming). The technical approach in this pa¬ 
per is fundamentally different from that in p^ . Besides, 
a Markovian mobility model is still assumed in p8) . 

Another related problem is the load balancing in dis¬ 
tributed systems, where the goal is to even out the load 
distribution across machines. Migration cost, future cost 
parameter prediction and the impact of prediction error 
are not considered in load balancing problems @, 

|22| . We consider all these aspects in this paper, and 
in addition, we consider a generic cost definition that 
can be defined to favor load balancing as well as other 
aspects. 

We also note that existing online algorithms with 
provable performance guarantees are often of theoretical 
nature p4|-p^, which may not be straightforward to 
apply in practical systems because these algorithms can 
be conceptually complex thus difficult to understand. At 
the same time, most online algorithms applied in prac¬ 
tice are of heuristic nature without theoretically prov¬ 
able optimality guarantees ||^; the performance of such 
algorithms are usually evaluated under a specific exper¬ 
imentation setting (see references of |j^), thus they may 
perform poorly under other settings that possibly occur 
in practice p3| . For example, in the machine scheduling 
problem considered in | |24] , a greedy algorithm (which is 
a common heuristic) that works well in some cases does 
not work well in other cases. We propose a simple and 
practically applicable online algorithm with theoretically 
provable performance guarantees in this paper, and also 
verify its performance with simulation using both S 5 m- 
thetic arrivals and real-world user traces. 

1.2 Main Contributions 

In this paper, we consider a general setting which 
allows heterogeneity in cost values, network structure, 
and mobility models. We assume that the cost is related 
to a finite set of parameters, which can include the 
locations and preferences of users, load in the system, 
database locations, etc. We focus on the case where there 
is an underlying mechanism to predict the future values 
of these parameters, and also assume that the prediction 
mechanism provides the most likely future values and 
an upper bound on possible deviation of the actual value 
from the predicted value. Such an assumption is valid 
for many prediction methods that provide guarantees on 
prediction accuracy. Based on the predicted parameters, 
the (predicted) future costs of each configuration can 
be found, in which each configuration represents one 
particular placement sequence of service instances. 
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With the above assumption, we formulate a problem 
of finding fhe optimal configuration of service insfances 
fhaf minimizes fhe average cosf over fime. We define a 
look-ahead window fo specify fhe amounf of fime that 
we look (predict) into the future. The main contributions 
of fhis paper are summarized as follows: 

1) We firsf focus on fhe offline problem of service in- 
sfance placemenf using predicfed cosfs within a spe¬ 
cific look-ahead window, where fhe insfance arrivals 
and deparfures wifhin this look-ahead window are 
assumed to be known beforehand. We show fhaf this 
problem is equivalent to a shortest-path problem 
in a virtual graph formed by all possible config- 
urafions, and propose an algorifhm (Algorifhm 
in Secfion |3.3) fo find its optimal solution using 
dynamic programming. 

2) We note that it is often practically infeasible fo know 
in advance about when an instance will arrive to 
or depart from fhe sysfem. Meanwhile, Algorithm 
may have exponential time-complexity when there 
exist multiple instances. Therefore, we propose an 
online approximation algorithm thaf finds the place¬ 
ment of a service insfance upon ifs arrival with 
pol 5 momial time-complexity. The proposed online 
algorithm calls Algorithm]^ as a subroutine for each 
instance upon its arrival. We analytically evaluate 
the performance of this online algorithm compared 
to the optimal offline placemenf. The proposed on¬ 
line algorithm is 0(l)-competitiv^for certain types 
of cosf funcfions (including fhose which are linear, 
pol 5 momial, or in some ofher specific form), under 
some mild assumpfions. 

3) Considering the existence of prediction errors, we 
propose a method to find the optimal look-ahead 
window size, such that an upper bound on the actual 
placement cost is minimized. 

4) The effectiveness of fhe proposed approach is eval¬ 
uated by simulations with both S 5 mthetic traces and 
real-world mobility traces of San Francisco faxis. 

The remainder of this paper is organized as follows. 
The problem formulafion is described in Section Sec¬ 
tion 1^ proposes an offline algorifhm to find fhe opfimal 
sequence of service insfance placemenf wifh given look¬ 
ahead window size. The online placemenf algorifhm 
and its performance analysis are presented in Section 
Section ^ proposes a method to find the optimal lool^ 
ahead window size. Section presents the simulation 
results and Section draws conclusions. 

2 Problem Formulation 

We consider a cloud computing system as shown in 
Fig-0 where the clouds are indexed by k S {1,2,..., A}. 
Each cloud k can be either an MMC or a backend cloud. 
All MMCs together with the backend cloud can host 
service instances that may arrive and leave the system 
over time. A service instance is a process that is executed 

3. We say that an online algorithm is c-competitive if its competitive 
ratio is upper bounded by c. 
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Figure 2. Timing of the proposed approach. 

for a particular task of a cloud service. Each service 
insfance may serve one or a group users, where there 
usually exists data transfer between the instance and the 
users it is serving. A time-slotted system as shown in 
Fig.0 is considered, in which the actual physical time 
interval corresponding to each slot t = 1,2,3, ... can be 
either the same or different. 

We consider a window-based control framework, 
where every T slots, a controller performs cosf predicfion 
and computes the service instance configuration for the 
next T slots. We define these T consecutive slots as 
a look-ahead window. Service instance placement within 
each window is found either at the begirming of the 
window (in the offline case) or whenever an insfance 
arrives (in fhe online case). We limif ourselves fo within 
one look-ahead window when finding the configuration. 
In other words, we do not attempt to find the place¬ 
ment in the next window until the time for fhe currenf 
window has elapsed and fhe next window starts. Our 
solution can also be extended to a slot-based control 
framework where the controller computes the next T- 
slot configuration at the begirming of every slot, based 
on predicted cost parameters for the next T slots. We 
leave the detailed comparison of fhese frameworks and 
fheir variafions for fufure work. 

2.1 Definitions 

We introduce some definitions in the following. A 
summary of main notations is given in Appendix 

2.1.1 Service Instances 

We say a service instance arrives to the system if it 
is created, and we say it departs from fhe sysfem if its 
operation is finished. Service instances may arrive and 
depart over time. We keep an index counter to assign an 
index for each new insfance. The counfer is inifialized 
fo zero when the cloud system starts to operat^ Upon 
a service instance arrival, we increment the counter by 
one, so that if fhe previously arrived insfance has index 
i, a newly arrived insfance will have index i -I- 1. With 
this definition, if i < if insfance i arrives no lafer fhan 
insfance i'. A parficular instance i can only arrive to the 
system once, and we assume that arrivals always occur 
at the begirming of a slot and departures always occur 
at the end of a slof. Eor example, consider fimeslofs t = 
1,2,3,4,5, instance i = 2 may arrive to the system at the 
beginning of slof t = 2, and deparf from the system at 
the end of slof f = 4. Af any fimeslof t, insfance i can 

4. This is for ease of presentation. In practice, the index can be reset 
when reaching a maximum counter number, and the definition of ser¬ 
vice configurations (defined later) can be easily modified accordingly. 
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have one of the following states: not arrived, running, or 
departed. For the above example, instance i = 2 has not 
yet arrived to the system in slot t = 1, it is running in 
slots t = 2,3, 4, and it has already departed in slot t = 5. 
Note that an instance can be rurming across multiple 
windows each containing T slots before it departs. 


2.1.2 Service Configurations 


Consider an arbitrary sequence of consecutive times¬ 
lots t G {to, to + 1, to + Q — 1}, where Q is an integer. 
For simplicity, assume that the instance with the smallest 
index running in slot to has index i = 1, and the instance 
with the largest index running in any of the slots in 
{to,..., to-l-Q— 1} has index M. According to the index as¬ 
signment discussed in Section [2. 1.1 there can be at most 
M instances running in any slot t G {tg, ...,to + Q — 1}. 

We define a Q-hy-M matrix denoted by tt, where its 
(g,z)th {q G {1,...,(5}) element (Tr),^ G {Q,l,2, ...,K} 
denotes the location of service instance i in slot tq = 
to + q — 1 stands for "is defined to be equal to"). 

We set {'K)qi according to the state of instance i in slot 
tq, as follows 


{rr)qi — 



if i is not rurming in slot tq 

if i is running in cloud k in slot tq 


where instance i is not rurming if it has not yet arrived 
or has already departed. The matrix tt is called the 
configuration of instances in slots {to,...,to -I- Q — 1}. 
Throughout this paper, we use matrix tt to represent 
configurations in different subsets of timeslots. We write 
Trfo, ti, ...,tn) to explicitly denote the configuration in 
slots {to,ti, ...,tn} (we have Q = tn—to + l), and we write 
TT for short where the considered slots can be inferred 
from the context. For a single slot t, Trf) becomes a 
vector (i.e., Q — 1). 

Remark: The configurations in different slots can ap¬ 
pear either in the same matrix or in different matrices. 
This means, from Trfo, ■.■,to + Q — 1), we can get 7r(t) 
for any t G {to, to + Q — 1}, as well as 7r(t — 1, t) for 
any t G {tg + 1, •••,to + Q — 1}/ etc., and vice versa. For 
the ease of presentation later, we define (7r(0))i = 0 for 
any i. 


2.1.3 Costs 

The cost can stand for different performance-related 
factors in practice, such as monetary cost (expressed as 
the price in some currency), service access latency of 
users (in seconds), service interruption time (in seconds), 
amount of transmission/processing resource consump¬ 
tion (in the number of bits to transfer, CPU cycles, 
memory size, etc.), or a combination of these. As long 
as these aspects can be expressed in some form of a cost 
function, we can treat them in the same optimization 
framework, thus we use the generic notion of cost in 
this paper. 

We consider two t 5 q)es of costs. The local cost U{t, 7r(t)) 
specifies the cost of data transmission (e.g., between each 
pair of user and service instance) and processing in slot t 


when the configuration in slot t is Trf). Its value can de¬ 
pend on many factors, including user location, network 
condition, load of clouds, etc., as discussed in SectionjU^ 
When a service instance is initiated in slot t, the local 
cost in slot t also includes the cost of initial placement 
of the corresponding service instance(s). We then define 
the migration cost W{t, Trf — l),7r(t)), which specifies 
the cost related to migration between slots t — 1 and 
t, which respectively have configurations 7r(t — 1) and 
7r(t). There is no migration cost in the very first timeslot 
(start of the system), thus we define W{1,-,-) = 0. The 
sum of local and migration costs in slot t when following 
configuration Trit — l,t) is given by 

C'^(t-l.t)(t) = U{t,TT{t))+W{t,TT{t- l),Tr(t)) (1) 


The above defined costs are aggregated costs for all service 
instances in the system during slot t. We will give 


concrete examples of these costs later in Section 4.2.1 


2.2 Actual and Predicted Costs 

To distinguish between the actual and predicted cost 
values, for a given configuration tt, we let (t) denote 
the actual value of and let D^flt) denote the pre¬ 
dicted (most likely) value of when cost-parameter 

prediction is performed at the beginning of slot tg- For 
completeness of notations, we define (t) = {t) for 

t < to, because at the beginning of to, the costs of all 
past timeslots are known. For t > to, we assume that the 
absolute difference between (t) and (t) is at most 

e(r) = max |A^(to -Hr) - D{{{to -H r)| 

which represents the maximum error when looking 
ahead for r slots, among all possible configurations tt 
(note that only 7r(to -H r — 1) and 7r(to -f r) are relevant) 
and all possible prediction time instant tg- The function 
e(T) is assumed to be non-decreasing with r, because 
we generally cannot have lower error when we look 
farther ahead into the future. The specific value of e(T) is 
assumed to be provided by the cost prediction module. 

We note that specific methods for predicting future 
cost parameters are beyond the scope of this paper, 
but we anticipate that existing approaches such as p^, 
and p7) can be applied. For example, one sirn^e 
approach is to measure cost parameters on the current 
network condition, and regard them as parameters for 
the future cost until the next measurement is taken. 
The prediction accuracy in this case is related to how 
fast the cost parameters vary, which can be estimated 
from historical records. We regard these cost parameters 
as predictable because they are generally related to the 
overall state of the system or historical pattern of users, 
which are unlikely to vary significantly from its previous 
state or pattern within a short time. This is different 
from arrivals and departures of instances, which can be 
spontaneous and unlikely to follow a predictable pattern. 


2.3 Our Goal 

Our ultimate goal is to find the optimal configuration 
7r*(l, ...,oo) that minimizes the actual average cost over 
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a sufficiently long time, i.e. 




7r*(l, ..., oo) =arg min 

7r(l,...,oo) 


lim 

^max ^ 


Trr, 


( 2 ) 


However, it is impractical to find the optimal solution to 
(|^, because we carmot precisely predict the future costs 
and also do not have exact knowledge on instance arrival 
and departure events in the future. Therefore, we focus 
on obtaining an approximate solution to ij^ by utilizing 
predicted cost values that are collected every T slots. 

Now, the service placement problem includes two 
parts: one is finding the look-ahead window size T, 
discussed in Section the other is finding the config¬ 
uration within each window, where we consider both 
offline and online placements, discussed in Sections 
(offline placement) and (online placement). The offline 
placement assumes that at the begirming of window 
T, we know the exact arrival and departure times of 
each instance within the rest of window T, whereas the 
online placement does not assume this knowledge. We 
note that the notion of "offline" here does not imply 
exact knowledge of future costs. Both offline and online 
placements in Sections!^ and Hare based on the predicted 
costs (t), the actualcost (t) is considered later in 
Section |5] 


3 Offline Service Placement with Given 
Look-Ahead Window Size 

In this section, we focus on the offline placement 
problem, where the arrival and departure times of future 
instances are assumed to be exactly known. We denote 
the configuration found for this problem by TToff. 

3.1 Procedure 

We start with illustrating the high-level procedure of 
finding TToff. When the look-ahead window size T is 
given, the configuration TTgff is found sequentially for 
each window (containing timeslots to,..., to + T — 1), by 
solving the following optimization problem: 

to-l-T-l 

7roff(to,-,to+T-l) = arg min V 

t — to 

(3) 

where D^{t) can be found based on the parameters 
obtained from the cost prediction module. The procedure 
is shown in Algorithm 

In Algorithm every time when solving (j^, we get 
the value of TToff for additional T slots. This is sufficient 
in practice (compared to an alternative approach that 
directly solves for TToff for all slots) because we only 
need to know where to place the instances in the current 
slot. The value of ^^{t) in 0 depends on the 

configuration in slot to ~ 1/ i-O- ’’’(to — l)/ according to Q. 
When to = 1/ ^(fo ~ 1) can be regarded as an arbitrary 
value, because the migration cost W(t, •, •) = 0 for t = 1. 

Intuitively, at the begirming of slot to, l|^ finds the 
optimal configuration that minimizes the predicted cost 
over the next T slots, given the locations of instances in 
slot to — 1. We focus on solving |j^ next. 


Algorithm 1 Procedure of offline service placement 

1: 

Initialize to = 1 


2: 

loop 


3: 

At the beginning of slot to, 

find the solution to 

4: 

Apply placements 7roff(to,. 

fo -l-T — 1) in timesmts to, •••, io -1- 


T-1 


5: 

tQ i— tQ T 


6: 

end loop 



Instance 

i—. 

in 


Figure 3. Shortest-path formulation with K = 2, M = 2, 
and T = 3. Instance t = 1 is running in all slots, instance 
i = 2 arrives at the beginning of slot to -l-1 and is running 
in slots to -I-1 and to + 2. 



3.2 Equivalence to Shortest-Path Problem 

The problem in ||^ is equivalent to a shortest-path 
problem with (t) as weights, as shown in Fig. 

Each edge represents one possible combination of con¬ 
figurations in adjacent timeslots, and the weight on each 
edge is the predicted cost for such configurations. The 
configuration in slot to — 1 is always given, and the num¬ 
ber of possible configurations in subsequent tim eslots 
is at most K^, where M is defined as in Section 2.1.2 


for the current window {to, ...,to -I- T — 1}, and we note 
that depending on whether the instance is running in 
the system or not, the number of possible configurations 
in a slot is either K or one (for configuration 0). Node 
B is a dummy node to ensure that we find a single 
shortest path, and the edges connecting node B have zero 
weights. It is obvious that the optimal solution to ® can 
be found by taking the shortest (minimum-wei^ted) 
path from node tt( to — 1) to node B in Fig. the nodes 
that the shortest path traverses correspond to the optimal 
solution Troff(to) + T — 1) for ||^. 


3.3 Algorithm 

We can solve the abovementioned shortest-path prob¬ 
lem by means of d 5 mamic programming p8) . The algo¬ 
rithm is shown in Algorithm H where we use Up(t, m) 
and Wp(t, n, m) to respectively denote the predicted lo¬ 
cal and migration costs when Tr(t) = m and Tr(t — 1) = n. 

In the algorithm. Lines [5}|T^ iteratively find the short¬ 
est path (minimum objective function) for each timeslot. 
The iteration starts from the second level of the virtual 
graph in Fig. H which contains nodes with Tr(to). It iter¬ 
ates through ml the subsequent levels that respectively 
contain nodes with 7r(to -I- 1), 7r(<o + 2), etc., excluding 
the last level with node B. In each iteration, the optimal 
solution for every possible (single-slot) configuration 
m is found by solving the Bellman's equation of the 
problem (Line [TT). Essentially, the Bellman's equation 
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Algorithm 2 Algorithm for solving ||^ 

1: Define variables m and n to represent configurations respectively 
in the current and previous iteration (level of graph) 

2: Define vectors tth, and ^rn for all m, n, where TTm (correspond¬ 
ingly ^rn) records the optimal configuration given that the con¬ 
figuration at the current (correspondingly previous) timeslot of 
iteration is m 

3: Define variables /rm and for all m to record the sum cost 
values from slot to respectively to the current and previous slot 
of iteration, given that the configuration is m in the current or 
previous slot 

4: Initialize /rm <— 0 and TTm <— 0 for all m 
5: for t = to,to + T — 1 do 
6: for all m do 

7: Um < /Tm 

8: 4m TTm 

9: end for 

10: for all m do 

11: n* <— argmiun {i/n + Up{t, m) + Wp{t, n, m)} 

12: 7ri„(fo, ...,t - 1) ^ 4n* ho, f - 1) 

13: 7rm{f) <— ni 

14: <—I'n* + Up(t,m) + Wp(t,ii* ,m) 

15: end for 

16: end for 

17: m* <— argminm 

18: 7rp££(io,to -b T — 1) <— {to ,to -b T — 1) 

19: return nofi{to, ...,to + T - 1) 

finds fhe shorfesf pafh between fhe fop node it {to — 1 ) 
and fhe currenf node under consideration (e.g., node 
7r(fo + l) = (1,1) in Fig. B , by considering all fhe nodes in 
fhe previous level (nodes 7r(fo) = (1,0) and 7r(fo) = (2,0) 
in Fig. 1^. The sum weigh! on fhe shorfesf pafh befween 
each node in fhe previous level and fhe fop node 7r(to — 1) 
is sfored in and fhe corresponding nodes fhaf fhis 
shorfesf pafh fraverses fhrough is sfored in Based 
on tZm and Line finds fhe shorfesf pafhs for all 
nodes in fhe currenf level, which can again be used for 
finding fhe shorfesf pafhs for all nodes in fhe nexf level 
(in fhe nexf iferafion). 

Affer iferafing fhrough all levels, fhe algorifhm has 
found fhe shorfesf pafhs befween fop node 7r(fo — 1) 
and all nodes in fhe lasf level wifh 7r(fo — T — 1). Now, 
Lines Hzl and [l^ find fhe minimum of all fhese shorfesf 
pafhs, giving fhe optimal configuration. If is obvious fhaf 
oufpuf of fhis algorifhm satisfies fhe Bellman's principle 
of opfimalify, so fhe resulf is fhe shorfesf pafh and hence 
fhe opfimal solufion fo (|^. 

Complexity: When fhe vecfors TTm and are 

sfored as linked-lisfs, Algorifhm has fime-complexify 
O because fhe minimizafion in Line [TT| requires 

enumerafing af mosf possible configurations, and 
fhere can be af mosf K^T possible combinations of 
values of t and m. 

4 Complexity Reduction and Online 
Service Placement 

The complexify of Algorifhm 0 is exponenfial in fhe 
number of insfances M, so if is desirable fo reduce fhe 
complexify. In fhis secfion, we propose a mefhod fhaf 
can find an approximafe solufion fo l|^ and, af fhe same 
fime, handle online insfance arrivals and deparfures fhaf 
are nof known beforehand. We will also show fhaf 
is NP-hard when M is non-consfanf, which jusfifies fhe 
need fo solve l|^ approximafely in an efficienf manner. 


4.1 Procedure 

In fhe online case, we modify fhe procedure given 
in Algorifhm so fhaf insfances are placed one-by-one, 
where each placemen! greedily minimizes fhe objective 
function given in (|^, while fhe configurations of previ¬ 
ously placed insfances remain unchanged. 

We assume fhaf each service insfance i has a max¬ 
imum lifetime Tlife(t), denofing fhe maximum number 
of remaining fimeslofs (including fhe currenf slof) fhaf 
fhe insfance remains in fhe sysfem. The value of Tlife(t) 
may be infinify for insfances fhaf can pofenfially sfay 
in fhe sysfem for an arbifrary amounf of fime. The 
acfual fime fhaf fhe insfance sfays in fhe sysfem may be 
shorfer fhan Tiife(t), buf if cannof be longer fhan Tiife(t). 
When an insfance leaves fhe sysfem before ifs maximum 
lifefime has elapsed, we say fhaf such a service insfance 
deparfure is unpredictable. 

We use TTon to denote the configuration tt computed 
by online placement. The configuration ttoh is updated 
every time when an instance arrives or unpredictably 
departs. At the beginning of the window (before any 
instance has arrived), it is initiated as an all-zero matrix. 

For a specific look-ahead window {fo, + T — 1}, 
when service instance i arrives in slot t G {fo,...,fo + 
T — 1}, we assume that this instance stays in the sys¬ 
tem until slot tf. = min {t + Tlife(z) — l;to + T — 1}, and 
accordingly update the configuration by 

te 

7ron(f, =arg min (4) 

s.t. 7r(f, ...,fe) = 7ron(L except for column i 

Note that only the configuration of instance i (which 
is assumed to be stored in the zth column of tt) is 
found and updated in Q, the configurations of all other 
instances i' ^ i remain unchanged. The solution to Q 
can still be found with Algorithm The only difference 
is that vectors m and n now become scalar values within 
because we only consider the configuration 
of a single instance i. The complexity in this case be¬ 
comes 0{K^T). At the beginning of the window, all the 
instances that have not departed after slot fo ~ 1 are 
seen as arrivals in slot fg, because we independently 
consider the placements in each window of size T. When 
multiple instances arrive simultaneously, an arbitrary 
arrival sequence is assigned to them; the instances are 
still placed one-by-one by greedily minimizing (|^. 

When an instance i unpredictably departs at the end 
of slot f G (fo,..., fo -I- T — 1}, we update TTon such that 
the ith column of 7ron(f -I-1, fo -I- T’ — 1) is set to zero. 

The online procedure described above is shown in 
Algorithm!^ Recall that 7ron(f, fe) and 7ron(f-l-l, fo + 
T — 1) are Doth part of a larger configuration matrix 
TTon (to, + T — 1) (see Section |ZL2) . 

Complexity: When placing a total of M instances, for 
a specific look-ahead window with size T, we can find 
the configurations of these M instances with complexity 
0{K'^TM), because (|^ is solved for M times, each with 
complexity 0{K‘^T). 
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Algorithm 3 Procedure of online service placement 

1: Initialize to = 1 
2: loop 

3: Initialize ■Kon{to, ■••Po + T — 1) as an all-zero matrix 

4: for each timeslot t = to, + T — 1 do 

5: for each instance i arriving at the beginning of slot t do 

6: tf, <— min {t + 7iife(*) - 1; fo + r - 1} 

7: Update 7ron(f, •••, fe) vcith the result from 

8: Apply configurations specified in the itn column of 

7ron(f, fe) for service instance i in timeslots i,te until 
instance i departs 
9: end for 

10: for each instance i departing at the end of slot t do 

11: Set the ith column of 7ron(f -I- 1, fo + ^ — 1) to zero 

12: end for 

13: end for 

14: to <— to T 

15: end loop 


Remark: It is important to note that in the above 
procedure, the configuration ttoh (and thus the cost value 
for f € {to,-,to + T’ - 1}) may vary 
upon instance arrival or departure. It follows that the T- 
slot sum cost t)(f) may vary whenever 

an instance arrives or departs at an arbitrary slot t G 
{to,to+T-1}, and the value of 

stands for the predicted sum cost (over the current win¬ 
dow containing T slots) under the current configuration, 
assuming that no new instance arrives and no instance 
unpredictably departs in the future. This variation in 
configuration and cost upon instance arrival/departure 
is frequently mentioned in the analysis presented next. 

4.2 Performance Analysis 

It is clear that for a single look-ahead window. Al¬ 
gorithm 1^ has pol 5 momial time-complexity while Algo¬ 
rithm]^ has exponential time-complexity. In this subsec¬ 
tion, we show the NP-hardness of the offline service 
placement problem, and discuss the optimality gap be¬ 
tween the online algorithm and the optimal offline place¬ 
ment. Note that we only focus on a single look-ahead 
window in this subsection. The interplay of multiple 
look-ahead windows and the impact of the window size 
will be considered in Section |5] 

4.2.1 Definitions 

For simplicity, we analyze the performance for a 
slightly restricted (but still general) class of cost func¬ 
tions. We introduce some additional definitions next (see 
Appendix for a summary of notations). 

Indexing of Instances: Here, we assume that the 
instance with lowest index in the current window 
{to, ...,to + T — 1} has index i = 1, and the last instance 
that arrives before the current time of interest has index 
i = M, where the current time of interest can be any 
time within the current window. With this definition, 
M does not need to be the largest index in window 
{to,..., to -I- T — 1}. Instead, it can be the index of any 
instance that arrives within {to, ...,to + T—l}. The cost of 
placing up to (and including) instance M is considered, 
where some instances i < M may have already departed 
from the system. 


Possible Configuration Sequence: When considering 
a window of T slots, we define the set of all possible con¬ 
figurations of a single instance as a set of T-dimensional 
vectors A = {(Ai,...,At) : A„ G {0,1,..., A},Vn G 
{1,..., T}, where A„ is non-zero for at most one block 
of consecutive values of n}. We also define a vector 
A G A to represent one possible configuration sequence of a 
single service instance across these T consecutive slots. 
For any instance i, the tth column of configuration matrix 
Tzfo, ■■■,to + T — 1) is equal to one particular value of A. 

We also define a binary variable Xi\, where Xi\ = 1 if 
instance i is placed according to configuration sequence 
A across slots {to, + T — 1} (i.e., the tth column of 
Tr{to,--.,to + T—l) is equal to A), and Xix = 0 otherwise. 
We always have ExeA ^ * G {1, M}- 

We note that the values of Xix may vary over time due 
to arrivals and unpredictable departures of instances, 
which can be seen from Algorithm and by noting the 
relationship between A and tt. Before instance i arrives, 
XiXo = 1 for Aq = [0, ...,0] which contains all zeros, and 
Xix = 0 for A ^ Ag. Upon arrival of instance i, we have 
XiXg = 0 and Xix^ = 1 for a particular Ai. When instance 
i unpredictably departs at slot t', its configuration se¬ 
quence switches from Ai to an alternative (but partly 
correlated) sequence A{ (i.e., (A{)t = (Ai)( for t < t' and 
(A{)t = 0 for t > t', where (A)^ denotes the tth element 
of A), according to Line 11 in Algorithm]^ after which 
XiXg = 0 and Xix[ = 1 . 

Resource Consumption: We assume that the costs 
are related to the resource consumption, and for the 
ease of presentation, we consider two t 5 q)es of resource 
consumptions. The first type is associated with serving 
user requests, i.e., data transmission and processing 
when a cloud is running a service instance, which we 
refer to as the local resource consumption. The second t 5 q)e 
is associated with migration, i.e., migrating an instance 
from one cloud to another cloud, which we refer to as 
the migration resource consumption. 

If we know that instance i operates under configura¬ 
tion sequence A, then we know whether instance i is 
placed on cloud k in slot t, for any k G {\,...,K{ and 
t G {to,---,to + 2^~l}- kVe also know whether instance 
i is migrated from cloud k to cloud I {I G {1, 2,..., Kj) 
between slots t — 1 and t. We use Oixk (t) > 0 to denote 
the local resource consumption at cloud k in slot t when 
instance i is operating under A, where aixk{t) = 0 if 
(A)t 7 ^ k. We use bixki{t) > 0 to denote the migration 
resource consumption when instance i operating under 
A is assigned to cloud k in slot t — 1 and to cloud I 
in slot t, where bixkiit) = 0 if {X)t-i f k or (A)t I, 
and we note that the configuration in slot to ~ 1 (before 
the start of the current window) is assumed to be given 
and thus independent of A. The values of aixk{t) and 
bixki{t) are either service-specific parameters that are 
known beforehand, or they can be found as part of the 
cost prediction. 

We denote the sum local resource consumption at 
cloud k by ykit) = LagA and denote 

the sum migration resource consumption from cloud k 








to cloud I by Zki{t) = Y.iiiY.\eA^i>^ki{t)xix. We may 
omit the argument t in the following discussion. 

Remark: The local and migration resource consump¬ 
tions defined above can be relafed fo CPU and com¬ 
munication bandwidfh occupafion, efc., or fhe sum of 
fhem. We only consider fhese two fypes of resource 
consumpfion for fhe ease of presenfafion. By applying 
fhe same fheorefical framework, fhe performance gap 
resulfs (presenfed lafer) can be exfended fo incorporafe 
mulfiple f 5 rpes of resources and more sophisficafed cosf 
functions, and similar resulfs yield for fhe general case. 

Costs: We refine fhe cosfs defined in Section 12.1.31 
by considering fhe cosf for each cloud or each pair 
of clouds. The local cosf af cloud k in fimeslof t is 
denofed by Uk,t iVkit))- When an insfance is initialed 
in slof t, fhe local cosf in slof t also includes fhe cosf 
of initial placemen! of fhe corresponding insfance. The 
migrafion cosf from cloud k fo cloud I between slots t—1 
and t is denoted by Wki,t {yk{t - 1), yi{t), Zki{t)). Besides 
Zki{t), the migration cost is also related to yk{t — 1) and 
yi{t), because additional processing may be needed for 
migration, and the cost for such processing can be related 
to the current load at clouds k and 1. The functions 
Uk,t {y) and Wki,t (yk, yi,Zki) can be different for different 
slots t and different clouds k and I, and they can depend 
on many factors, such as network condition, background 
load of the cloud, etc. Noting that any constant term 
added to the cost function does not affect the optimal 
configuration, we set Mfc,t(0) = 0 and 0, 0) = 0. 

We also set Wki^ti,-, •, 0) = 0, because there is no migration 
cost if we do not migrate. There is also no migration cost 
at the start of the first timeslot, thus we set Wki,t{-, •, •) = 0 
for t = 1. With these definitions, the aggregated costs 
U{t,TT(t)) and W(t,7r{t — l),7r(t)) can be explicitly ex¬ 
pressed as 

K 

U{t,n{t)) = '^Uk,t{yk{t)) (5) 

k^l 
K K 

W{t, Tr{t-l),n{t))=Y^'^Wki,t{yk{t-l),yi{t),Zki{t)) (6) 

k=ll = l 

We then assume that the following assumption is 
satisfied for the cost functions, which holds for a large 
class of practical cost functions, such as those related to 
the delay performance or load balancing ||^. 

Assumption 1. Both Uk,t{y) and Wki,t{yk,yh zu) are con¬ 
vex non-decreasing/wnctfons ofy (or yk,yh zu), satisfying: 

. ^(0)>0 

• ^{■,;0)>0fort>2 

for all t, k, and I (unless stated otherwise), where de¬ 

notes the derivative of Uk,t with respect to (w.r.t.) y evaluated 
at y = 0, and (tjO) denotes the partial derivative of 

Wki^t w.r.t. Zki evaluated at Zki = 0 and arbitrary yk and yi. 

Vector Notation: To simplify the presentation, we use 
vectors to denote a collection of variables across multiple 
clouds, slots, or configuration sequences. For simplicity, 
we index each element in the vector with multiple 


indexes that are related to the index of the element, and 
use the general notion (or to denote 

the (/ii,/i 2 )th (or (hi,/ 12 ,/i 3 )th) element in an arbitrary 
vector g. Because we know the range of each index, 
multiple indexes can be easily mapped to a single index. 
We regard each vector as a single-indexed vector for the 
purpose of vector concatenation (i.e., joining two vectors 
into one vector) and gradient computation later. 

We define vectors y (with KT elements), z (with 
K^T elements), x (with elements), a^x (with KT 

elements), and b^x (with K^T elements), for every value 
of t G {1, 2,..., M} and A G A. Different values of i and A 
correspond to different vectors a^x and b^x- The elements 
in these vectors are defined as follows: 

(y)fet - 2/fcW- (z)fcii - zki{t), (yi),x = x^x, 


{^ix)kt ~ CliXk{t): {^ix)klt ~ biXkl{t) 

As discussed earlier in this section, Xix may unpre- 
dictably change over time due to arrivals and departures 
of service instances. It follows that the vectors x, y, and 
z may vary over time (recall that y and z are dependent 
on X by definition). The vectors a^x and b^x are constant. 

Alternative Cost Expression: Using the above defini¬ 
tions, we can write the sum cost of all T slots as follows 


D{-x) = D(y,z) = 


A 


K 


'^uk,t {yk{t)) 


k^l 


K K 

+EE Wki,t {yk{t- l),yi{t),Zki{t)) 


(7) 


where the cost function D{-) can be expressed either in 
terms of x or in terms of (y, The cost function defined 
in (7l is equivalent to readers are 

also referred to the per-slot cost defined in Q for com¬ 
parison. The value of D (x) or, equivalently, D (y, z) may 
vary over time due to service arrivals and unpredictable 
service instance departures as discussed above. 


4.2.2 Equivalent Problem Formulation 

With the above definitions, the offline service place¬ 
ment problem in ||^ can be equivalently formulated as 
the following, where our goal is to find the optimal con¬ 
figuration for all service instances 1,2,..., M (we consider 
the offline case here where we know when each instance 
arrives and no instance will unpredictably leave after 
they have been placed): 

min D (x) (8) 

X 

s.t. Xix = l,vt G {1,2, ...,M} 

xeAi 

Xix G {0, l},Vi G {1,2,...,M},A G Ai 

where C A is a subset of feasible configuration 
sequences for instance i, i.e., sequences that contain those 
vectors whose elements are non-zero starting from the 
slot at which i arrives and ending at the slot at which i 
departs, while all other elements of the vectors are zero. 

We now show that l|^, and thus l|^, is NP-hard even 
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in the offline case, which further justifies fhe need for an 
approximafion algorifhm for solving fhe problem. 


Proposition 1. (NP-Hardness) The problem in ([^ in the 
offline sense, and thus ([^, is NP-hard. 

Proof. Problem (|^ can be reduced from fhe parfifion 
problem, which is known fo be NP-complefe 129 Corol¬ 
lary 15.28]. See Appendix for defails. □ 

An online version of problem ||^ can be consfrucfed 
by updafing over time. When an arbitrary instance i 
has not yet arrived, we define as fhe sef confaining an 
all-zero vector. Affer insfance i arrives, we assume fhaf 
it will run in the system until te (defined in Section |4.1| , 
and updafe A^ fo conform fo fhe arrival and deparfure 
times of insfance i (see above). Affer insfance i deparfs, 
Ai can be furfher updafed so fhaf fhe configurations 
corresponding fo all remaining slofs are zero. 


4.2.3 Performance Gap 

As discussed earlier, Algorifhm [^solves l|^ in a greedy 
manner, where each service insfance i is placed to greed¬ 
ily minimize in ||^. In the following, we compare fhe 
resulf from Algorifhm 1^ wifh fhe frue optimal resulf, 
where fhe optimal result assumes offline placement. We 
use X and (y,z) to denote the result from Algorithm!^ 
and use x* and (y*,z*) to denote the offline optimm 
result to Q. 

Lemma 1. (Convexity of D{-)) When Assumption is 
satisfied, the cost function D (x) or, equivalently, D (y,z) is 
a non-decreasing convex function w.r.t. x, and it is also a 
non-decreasing convex function w.r.t. y and z. 

Proof. According to Assumption a Uk,t{yk(t)) and 
Wki,t{yk{t - ^),yi{t),Zki{t)) are non-decreasing convex 
functions. Because yk{t) and Zki{t) are linear mappings 
of Xix with non-negative weights for any t, k, and I, and 
also because the sum of non-decreasing convex functions 
is still a non-decreasing convex function, the lemma 

holds iso] Section 3.2]. □ 

In the following, we use Vx to denote the gradient 
w.r.t. each element in vector x, i.e., the (f, A)th element of 
Vxf9(x) is Similarly, we use Vy,z to denote the 

gradient w.r.t. each element in vector (y, z), where (y, z) 
is a vector that concatenates vectors y and z. 

Proposition 2. (Performance Gap) When Assumption 
is satisfied, we have 

5(x) < DifxPrC) (9) 

or, equivalently, 

5(y,z) < 5(0V'y*,(^^/'z*) (10) 


for any i and A G Ai, in which y,„ax and z^ax respectively 
denote the maximum values of y and z (the maximum is 
taken element-wise) after any number of instance arrivals 
within slots {to,...,to -I- T — 1} until the current time of 
interest (at which time the latest arrived instance has index 
M), {aix, bix) is a vector that concatenates a^x and b^x, and 
denotes the dot-product. 


Proof. See Appendix [C| □ 

Remark: We note that according to the definition of M 
in Section 4.2.1 the bound given in Proposition |2] holds 
at any tiirie of interest within slots {fo, + T ~ ^}r 
i.e., for any number of instances that has arrived to the 
system, where some of them may have already departed. 


4.2.4 Intuitive Explanation to the Constants f and 

The constants f and ip in Proposition are related 
to "how convex" the cost function is. In other words, 
they are related to how fast the cost of placing a single 
instance changes under different amount of existing 
resource consumption. Figure 0 shows an illustrative 
example, where we only consider one cloud and one 
timeslot (i.e., f = 1, T = 1, and K = 1). In this case, 
setting (p = ^(ymax -i amax)/^(y) satisfies where 
flmax denotes the maximum resource consurn^ion of a 
single instance. Similarly, setting ip = ^(y) ■ y/D{y) 
satisfies We can see that the values of (p and ip need 
to be larger when the cost function is more convex. For 
the general case, there is a weighted sum in both the 
numerator and denominator in 0 and (T^ . However, 
when we took at a single cloud (for the local cost) or a 
single pair of clouds (for the migration cost) in a single 
timeslot, the above intuition still applies. 

So, why is the optimality gap larger when the cost 
functions are more convex, i.e., have a larger second 
order derivative? We note that in the greedy assignment 
procedure in Algorithm we choose the configuration 
of each instance i by minimizing the cost under the 
system state at the time when instance i arrives, where 
the system state represents the local and migration re¬ 
source consumptions as specified by vectors y and z. 
When cost functions are more convex, for an alternative 
system state (y',z'), it is more likely that the placement 
of instance i (which was determined at system state 
(y, z)) becomes far from optimum. This is because if cost 
functions are more convex, the cost increase of placing 
a new instance i (assuming the same configuration for 
i) varies more when (y, z) changes. This intuition is 
confirmed by formal results described next. 


4.2.5 Linear Cost Functions 

Consider linear cost functions in the form of 


where cp and ip are constants satisfying 

, ^ Rf y.’z.C (ymax P U^x, Z/n^x “t” hjx) ’ (toA, h^x) 
(p > -^- 

Vy,zH(y,z) • (aix,bix) 

^ VxH (x) ■ X ^ Vy,z-P (y,z) ■ (y,z;) 
5(x) D{y,z) 


( 11 ) 


( 12 ) 


Uk.tiy) = Tk,ty (13) 

Wkl.t {yk, yuzki) = n.^kltyk + (14) 

where the constants yk,t, ri^kit > — 0- 


Proposition 3. When the cost functions are defined as in i 13 • 
and ([l4]), Algorithm^provides the optimal solution. 
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Figure 4. Illustration of the performance gap for t = 1, T = 
1, and K = 1, where a^ax denotes the maximum resource 
consumption of a single instance. In this example, O 


becomes A > 


4*Aei 


and '12 becomes ip > 


^dei 


Proof. We have 


^y,z-^ (ymax "^max hi_\) — Vy,z-^ (y; z) 


Vy,z£' (y, z) • (y, z) = D (y, z) 

because the gradient in this case is a constant. Hence, 
choosing cp = ip = 1 satisfies © and yielding 
H(x) < D{x*) which means that the solution from 
Algorithm is not worse than the optimal solution. □ 

This implies that the greedy service placement is opti¬ 
mal for linear cosf funcfions, which is intuifive because 
fhe previous placemenfs have no impacf on fhe cosf of 
lafer placemenfs when the cost function is linear. 

4.2.6 Polynomial Cost Functions 

Consider pol 5 momial cost functions in the form of 

'u-kAy) = (15) 

P 

Wki.t iyk,yi,zki) = ( 16 ) 

Pl P2 P3 

where p, pi,p 2 ,P 3 are infegers safisfying p > 1, pi -F P 2 + 
P 3 > 1 and fhe consfanfs > 0, > 0. 

We firsf infroduce fhe following assumpfion which 
can be safisfied in mosf pracfical sysfems with an upper 
bound on resource consumptions and departure rates. 

Assumption 2. The following is satisfied: 

• For all i, A, k, I, t, there exists a constants a^ax (md b,„ax, 

such that Q/iXkPf) lPi ct^ax and ^ b^ax 

• The number of instances that unpredictably leave the 
system in each slot is upper bounded by a constant B^. 

Proposition 4. Assume that the cost functions are defined 
as in ( [I^ and ( [I^ while satisfying Assumption [I] and that 
Assumption is satisfied. 

Let n denote the maximum value of p such that 44 > 0 

or > 0, subject to P 1 +P 2 +P 3 = P- The value of Q 

represents the highest order of the polynomial cost functions. 

Define T{I{M)) = 5(xx(m))/5(xJ(j^^)), where X{M) 

is a problem inpu^containing M instances, and xx(m) mirf 
^i(M) respectively the online and offline (optimal) results 
for input I{M). We say that Algorithm is c-competitive 
in placing M instances ifT = maxx(M) r(I(M)) < cfor a 
given M. We have: 

5. A particular problem input specifies the time each instance ar¬ 
rives/departs as well as the values of and for each i, \. 


• Algorithm^is 0(1)-competitive. 

• In particular, for any 5 > 0, there exists a sufficiently 
large M, such that Algorithm^is (fl^ -F 6)-competitive. 

Proof. See Appendix [D| □ 

Proposition Instates that the competitive ratio does not 
indefinitely increase with increasing number of insfances 
(specified by M). Insfead, if approaches a consfanf value 
when M becomes large. 

When fhe cosf funcfions are linear as in (13) a nd (14) , 
we have D = 1. In fhis case, ProposifionE] gives a 
compefifive ratio upper bound of 1 -F 5 (for sufficienfly 
large M) where <5 > 0 can be arbifrarily small, while 
Proposition shows fhaf Algorifhm is opfimal. This 
means fhaf me compefifive ratio upper bound given in 
Proposifion 1^ is asymptotically tight as M goes fo infinify 

4.2.7 Linear Cost at Backend Cloud 

Algorifhm is also 0(l)-compefifive for some more 
general forms of cosf funcfions. For example, consider 
a simple case where fhere is no migration resource 
consumption, i.e. bixkft) — 0 for all Define 

Uko,t{y) = 72 / for some cloud fcp and all t, where 7 > 0 is 
a consfanf. For all ofher clouds k f kg, define Uk,tiy) as 
a general cosf function while safisfying Assumpfion 
and some additional mild assumptions presenfed be¬ 
low. Assume fhaf fhere exisfs a consfant Umax such fhaf 
aiXkf) < Umax for alH, A, fc, t. 

Because Uk,t{y) is convex non-decreasing and Algo¬ 
rithm 1^ operates in a greedy manner, if {y) > 7/ 
no new insfance will be placed on cloud k, as if incurs 
higher cosf fhan placing if on kg. As a resulf, fhe maxi¬ 
mum value of yk (t) is bounded, lef us denofe fhis upper 
bound by y^^{t). We nofe fhaf y™^^(t) is only dependent 
on the cost function definition and is independenf of fhe 
number of arrived insfances. 

Assume Mfc.t(2/™’‘(f)) < 00 and ^^{yk^'^if) + a^ax) < 
00 for all k f kfj and t. When ignoring fhe cosf af cloud 
fco/ the ratio T{I{M)) does not indefinitely grow with 
incoming instances, because among all t/fc(f) G [ 0 , 2 /““(t)] 
for all t and k f kg, we can find f and f fhat safisfy 
(T^ and (H}. we can also find fhe compefifive rafio 
maxx(M) r(X(M)). The resulting F is only dependenf 
on fhe cosf function definition, hence if does nof keep 
increasing wifh M. Taking info accounf fhe cosf at cloud 
ko, the above result still applies, because the cost at kg 
is linear in ykg (t), so that in either of (TT) , (T^ , or in fhe 
expression of r(I(M)), fhe exisfence of fhis linear cosf 
only adds a same quantify (which mighf be differenf in 
differenf expressions fhough) fo bofh fhe numerafor and 
denominafor, which does nof increase F (because P > 1). 

The cloud kg can be considered as fhe backend cloud, 
which usually has abundant resources thus its cost-per- 
unit-resource often remains unchanged. This example 
can be generalized to cases with non-zero migration 
resource consumption, and we will illustrate such an 
application in the simulations in Section 
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5 Optimal Look-Ahead Window Size 

In this section, we study how to find the optimal 
window size T to look-ahead. When there are no errors 
in the cost prediction, setting T as large as possible 
can potentially bring the best long-term performance. 
However, fhe problem becomes more complicafed when 
we consider fhe predicfion error, because fhe farfher 
ahead we look info fhe fufure, fhe less accurafe fhe 
predicfion becomes. When T is large, fhe predicfed cosf 
value may be far away from fhe acfual cosf, which can 
cause fhe configuration obtained from predicfed cosfs 
wifh size-T windows (denofed by TTp) deviafe 
significanfly from fhe hue opfimal configuration tt* ob¬ 
tained from acfual cosfs AT^{t). Nofe fhaf tt* is obfained 
from acfual cosfs which is differenf from x* and 

(y*,z*) which are obfained from predicfed cosfs D^^(t) 
as defined in Secfion |4.2[ Also nofe fhaf and tt* 
specify fhe configurafions for an arbitrarily large number 
of fimeslofs, as in ||^. Conversely, when T is small, fhe 
solufion may nof perform well in fhe long-ferm, because 
fhe look-ahead window is small and the long-term effect 
of migrafion is nof considered. We have fo find fhe 
opfimal value of T which minimizes bofh fhe impacf of 
predicfion error and fhe impacf of fruncafing fhe look¬ 
ahead fime-span. 

We assume fhaf fhere exisfs a consfanf a safisfying 

max Wa(f, 7r(f — 1), 7r(f)) < cr (17) 

7r(t-l,t) 

for any t, fo represenf fhe maximum value of fhe acfual 
migrafion cosf in any slof, where Wa(f,7r(f — l),7r(f)) 
denofes fhe acfual migrafion cosf. The value of a is 
sysfem-specific and is relafed fo fhe cosf definifion. 

To help wifh our analysis below, we define fhe sum- 
error sfarfing from slof fo up fo slof -\-T — 1 as 

tQ-\-T 

F{T)A Y, e{t-h) (18) 

t=to 

Because e(f —fo) > 0 and e(f — fo) is non-decreasing wifh 
f, if is obvious fhaf J^(r-^2)-J^(r-Hl) > F{T+1)-F(T). 
Hence, F(T) is a convex non-decreasing funcfion for T > 
0, where we define F{0) = 0. 


5.1 Upper Bound on Cost Difference 

In fhe following, we focus on fhe objecfive funcfion 
given in ||^, and sfudy how worse fhe configurafion 
can perform, compared to the optimal configuration tt*. 


Proposition 5. For look-ahead window size T, suppose that 
we can solve Q with competitive ratio T > 1, the upper bound 
on the cost difference (while taking the competitive ratio T into 
account) from configurations iVp and tt* is given by 


lim 




T 

J. fyj 




max A 


(f)^ ^ (r + l)F(T)+a 


T 


(19) 

Proof. See Appendix]^ □ 

We assume in the following fhaf fhe compefifive rafio 
r is independenf of fhe choice of T, and regard if as a 


given paramefer in the problem of finding opfimal T. 
This assumpfion is jusfified for several cosf funcfions 
where fhere exisf a uniform bound on fhe compefifive 


rafio for arbitrarily many services (see Sections 4.2.5 
4.2.7|. We define fhe optimal look-ahead window size as fhe 


solufion fo fhe following opfimization problem: 


min 

T 

S.f. 


(r-il)F(T)+a 

T 

T > 1 


( 20 ) 


Considering fhe original objecfive in ||^, fhe problem 
I p0| | can be regarded as finding fhe opfimal look-ahead 
window size such fhaf an upper bound of fhe objecfive 
funcfion in is minimized (according fo Proposifion 

t . The solufion fo ( [20) is fhe opfimal window size fo 
ok-ahead so fhaf (in fhe worsf case) fhe cosf is closesf 
fo fhe cosf of fhe opfimal configurafion tt*. 


5.2 Characteristics of the Probiem in po) 

We now study the characteristics of <|20|. To help 
with the analysis, we interchangeably use variable T to 
represent either a discrete or a continuous variable. We 
define a continuous convex function G{T), where T > 1 
is a continuous variable. The function G{T) is defined in 
such a way that G{T) = F{T) for all the discrete values 
T s {1,2,...}, i.e., G{T) is a continuous time extension of 
F{T). Such a definition is always possible by cormecting 
the discrete points in F{T). Note that we do not assume 
the continuity of the derivatives of G{T), which means 
that may be non-continuous and ^ 2 '^ may have 
-icxD values. However, these do not affect our analysis 
below. We will work with continuous values of T in 
some parts and will discretize it when appropriate. 

We define a function 9{T) = fT+J-)^ . (^) . + . ?' , to represent 
the objective function in |20| after replacing F(T) with 
G{T), where T is regardedas a continuous variable. We 
take the logarithm of 9(T), yielding 

ln6» = ln((r-hl)G(r)-hcr)-lnT (21) 

Taking the derivative of In 0, we have 

df^ ^ (T + l)^ _ 1 

dT (T + l)G{T) + a T ^ ’ 

We set ^ 2 ) equal to zero, and rearrange the equation, 
yielding 

$(r) ^{T + - (r -f l)G(T) -a = 0 (23) 

dT 

We have the following proposition and its corollary, 
their proofs are given in Appendix 

Proposition 6. Let Tq denote a solution to ( [Z^ , if the 
solution exists, then the optimal look-ahead window size T* 
for problem is either [TqJ or [Tg], where [xj and [x] 
respectively dmote the floor (rounding down to integer) and 
ceiling (rounding up to integer) of x . 

Corollary 1. For window sizes T and T -y I, if 9{T) < 
9{T 1), then the optimal size T* < T; if 9{T) > 9(T 1), 

then T* >T+ 1; if 9{T) = 9{T + 1), then T* = T. 
















12 


Algorithm 4 Binary search for finding opfimal window 
size_ 

1: Initialize variables T_ <— 1 and r+ <— Tm 
2: repeat 

3: T^y{T.+T+)/2\ 

4: if 6»(T) < e(r + 1) then 

5: T+ ^ T 

6: else if 0(T) > d{T + 1) then 

7: T_ <- T + 1 

8: else if 0(T) = 0(T + 1) then 

9: return T //Optimum found 

10: end if 

11: until T_ = T+ 

12: return T_ 


5.3 Finding the Optimai Soiution 

According fo Proposifion we can solve < [2^ fo 
find fhe opfimal look-ahead window size. When G{T) 
(and F(T)) can be expressed in some specific analyfical 
forms, fhe solufion fo < [23) can be found analytically. For 
example, consider G(Tj = F{T) = j3T°‘, where /3 > 0 


and 


> 1. In fhis case, Tq = 


((r+l)^(a-l)) 


and 


(r+l)/3(a-l)^ 

T* = 0 (T). One can also use such 

specific forms as an upper bound for a general function. 

When G{T) (and F[T)) have more general forms, 
we can perform a search on fhe opfimal window size 


according fo fhe properfies discussed in Secfion 5.2 
Because we do nof know fhe convexify of 6{T) or $(r), 
sfandard numerical mefhods for solving ||M I or ( [23) may 
nof be efficienf. However, from Corollary ^ we know 
fhaf fhe local minimum of 9{T) is fhe globm minimum, 
so we can develop algorifhms fhaf use fhis properfy. 

The opfimal window size T* fakes discrefe values, so 
we can perform a discrefe search on T e {1,2,..., T^}, 
where Tm > 1 is a pre-specified upper limif on fhe 
search range. We fhen compare 9{T) wifh 9(T -(-1 ) and 
defermine fhe opfimal solufion according fo Corollary [^ 
One possible ^proach is fo use binary search, as shown 
in Algorifhmffl which has fime-complexify of O (log Tm)- 

Remark: The exacf value of T may be difficulf fo find 
in pracfice, and ( [T^ is an upper bound which may have 
a gap from fhe acfual value of fhe leff hand-side of 
( |T^ . Therefore, in pracfice, we can regard T as a funing 
paramefer, which can be funed so fhaf fhe resulting 
window size T* yields good performance. For a similar 
reason, fhe paramefer a can also be regarded as a funing 
paramefer in pracfice. 

6 Simulation Results 

In fhe simulafions, we assume fhaf fhere exisf a back¬ 
end cloud (wifh index feg) and mulfiple MMCs. A service 
insfance can be placed eifher on one of fhe MMCs or on 
fhe backend cloud. We firsf define 

ify<Y 
-|-(X), if y >Y 


Riy) = 


(24) 


where Y denofes fhe capacify of a single MMC. Then, 
we define fhe local and migration cosfs as in ||^, ||^, wifh 


\yk{t)R{yk{t)) + grk{t), if k ^ ko 


(25) 


Wki,t(yk(t- l),yi(t),zki(t)) 

A J hzki(t), if k = ko or/and I = ko 

jzfci(f) {R{ykit)) -b R(yi{t))) + hski(t), else 

(26) 

where yk{t) and Zki{t) are sum resource consumptions 
defined as in Secfion 4.2.1 rk{t) is fhe sum of fhe 


disfances befween each insfance running on cloud k 
and all users connecfed fo fhis insfance, Ski{t) is fhe 
disfance befween clouds k and I multiplied by fhe 
number migrafed insfances from cloud k fo cloud I, and 
g,g,h,h are simulation paramefers (specified lafer). The 
distance here is expressed as fhe number of hops on fhe 
communicafion nefwork. 


Similar fo Secfion 4.2.7 (buf wifh migrafion cosf here), 
we consider fhe scenario where fhe connecfion sfafus fo 
fhe backend cloud remains relafively unchanged. Thus, 
in (^ and ( j^ , Uk,t{-) and Wki,t(-, ■, •) are linear in yk{t) 
andzfc;(f) vmen involving fhe backend cloud ko. When 
nof involving fhe backend cloud, fhe cosf functions have 
fwo ferms. The firsf ferm confains i?( ) and is relafed 
fo fhe queuing delay of dafa processing/fransmission, 
because R(-) has a similar form as fhe average queueing 
delay expression from queueing fheory. The addifional 
coefficienf yk{t) or Zki{t) scales fhe delay by fhe fofal 
amounf of workload so fhaf experiences of all insfances 
(hosfed af a cloud or being migrafed) are considered. 
This expression is also a widely used objective (such as 
in 0 ) for pushing fhe sysfem towards a load-balanced 
sfafe. The second ferm has fhe disfance of dafa frans¬ 
mission or migrafion, which is relafed fo propagafion 
delay. Thus, bofh queueing and propagafion delays are 
capfured in fhe cosf definition above. 

Nofe fhaf fhe above defined cosf functions are hef- 
erogeneous, because fhe cosf definitions are differenf 
depending on whefher fhe backend cloud is involved 
or nof. Therefore, we cannof direcfly apply fhe exisfing 
MDP-based approaches p0)-p2] fo solve fhis problem. 
We consider users continuously connecfed fo service 
insfances, so we also cannof apply fhe fechnique in 


6.1 Synthetic Arrivals and Departures 

To evaluate how much worse the online placement 
(presented in Section [^ performs compared to the opti¬ 
mal offline placement (presented in Section j^, we first 
consider a setting with S 5 mthetic instance arrivals and 
departures. For simplicity, we ignore the migration cost 
and set 5 = 0 to make the local cost independent of 
the distance rfc(t). We set Y = 5, 5 = 3, and the total 
number of clouds K = b among which one is the 
backend cloud. We simulate 4000 arrivals, where the 
local resource consumption of each arrival is uniformly 
distributed within interval [0.5,1-5]. Before a new in¬ 
stance arrives, we generate a random variable H that 
is uniformly distributed within [0,1]. If iF < 0.1, one 
randomly selected instance that is currently running in 
the system (if any) departs. We only focus on the cost in 
a single timeslot and assume that arrival and departure 
events happen within this slot. The online placement 














13 




(a) (b) 

Figure 5. Results with synthetic traces: (a) objective func¬ 
tion value, (b) average performance ratio. 


greedily places each instance, while the offline placement 
considers all instances as an entirety. We compare the 
cost of fhe proposed online placemen! algorifhm wifh 
a lower bound of fhe cosf of fhe opfimal placement. 
The optimal lower bound is obtained by solving an 
optimization problem that allows every instance to be 
arbitrarily split across multiple clouds, in which case the 
problem becomes a convex optimization problem due to 
the relaxation of infeger consfrainfs. 

The simulafion is run wifh 100 differenf random 
seeds. Fig. shows fhe overall resulfs. We see fhaf 
fhe cosf is convex increasing when fhe number of ar¬ 
rived insfances is small, and it increases linearly when 
the number of insfances becomes large, because in fhe 
laffer case, fhe MMCs are close fo being overloaded 
and mosf insfances are placed af fhe backend cloud. 
The facf fhaf fhe average performance rafio (defined as 


mean 


^mean converges wifh increas¬ 

ing number of insfances in Fig. |^b) supporfs our anal¬ 


ysis in Secfion 4.2.7 


6.2 Real-World Traces 

To furfher evaluafe fhe performance while considering 
fhe impacf of predicfion errors and look-ahead window 
size, we perform simulafions using real-world San Fran¬ 
cisco faxi fraces obfained on fhe day of May 31, 2008 
[ [3H , | [3^ . Similar fo [T^ , we assume fhaf fhe MMCs are 
deployed according to a hexagonal cellular structure in 
the central area of San Francisco (fhe cenfer of fhis area 
has lafifude 37.762 and longifude —122.43). The disfance 
befween fhe cenfer poinfs of adjacenf cells is 1000 m. 
We consider K = %1 cells (fhus MMCs), one backend 
cloud, and 50 users (faxis) in fofal and nof all fhe users 
are acfive af a given fime. A user is considered active if 
ifs mosf recenf location update was received within 600 s 
from fhe currenf fime and ifs locafion is wifhin fhe area 
covered by MMCs. Each user may require af mosf one 
service af a fime from fhe cloud when if is acfive, where 
fhe durafion fhaf each acfive user requires (or, does nof 
require) service is exponentially disfribufed wifh a mean 
value of 50 slofs (or, 10 slofs). When a user requires 
service, we assume fhaf fhere is a service insfance for 
fhis parficular requesf (independenf from ofher users) 
running on one of fhe clouds. The local and migration 


(if migration occurs) resource consumpfions of each such 
insfance are sef fo 1. We assume fhaf fhe online algorifhm 
has no knowledge on fhe deparfure fime of insfances 
and sef Tme = oo for all insfances. Nofe fhaf fhe faxi 
locafions in fhe dafasef are unevenly disfribufed, so if 
is sfill possible fhaf one MMC hosfs mulfiple services 
alfhough fhe maximum possible number of insfances 
(50) is smaller fhan fhe number of MMCs (91). The 
disfance mefric (for evaluafing rk(t) and Ski.{t)) is defined 
as fhe minimum number of hops befween fwo locafions 
on fhe cellular sfrucfure. The physical fime correspond¬ 
ing fo each slof is sef fo 60 s. We sef fhe paramefers 
r = 1.5,cr = 2,Y = 5,g = h = 3,g = h = 0.2. The cosf 
predicfion error is assumed fo have an upp er bound in 
fhe form of F(T) = I3T°‘ (see Secfion |5.3[ , where we 
fix a = 1.1. The predicfion error is generafed randomly 
while ensuring fhaf fhe upper bound is safisfied. 

The simulafion resulfs are shown in Fig. 1^ In Fig. 
we can see fhaf fhe resulf of fhe proposed online 
placement approach (E) performs close fo fhe case of 
online placemen! wifh precise fufure knowledge (D), 
where approach D assumes fhaf all fhe fufure cosfs 
as well as insfance arrival and deparfure fimes are 
precisely known, buf we sfill use fhe online algorifhm 
fo defermine fhe placement (i.e., we greedily place each 
instance), because the offline algorifhm is foo fime con¬ 
suming due fo ifs high complexify. The proposed mefhod 
E also outperforms alfernafive mefhods including only 
placing on MMCs and never migrafe fhe service in¬ 
sfance affer inifialization (A), always following fhe user 
when fhe user moves fo a differenf cell (B), as well 
as always placing fhe service insfance on fhe backend 
cloud (C). In approaches A and B, fhe insfance placemen! 
is defermined greedily so fhaf fhe disfance befween 
fhe insfance and ifs corresponding user is fhe shorfesf, 
subjecf fo fhe MMC capacify consfrainf Y so fhaf fhe 
cosfs are finite (see @). The fluctuation of fhe cosf 
during fhe day is because of differenf number of users 
fhat require fhe service (fhus different system load). In 
Eig. I^b), we show the average cost over the day with 
different look-ahead window sizes and f3 values (a large 
/3 indicates a large prediction error), where the average 
results from 8 differenf random seeds are shown. We 
see fhaf fhe opfimal window size (T*) found from fhe 
mefhod proposed in Secfion is close fo fhe window 
size fhaf brings fhe lowesf cosf, which implies fhaf fhe 
proposed mefhod for finding T* is reasonably accurafe. 

Additional resulfs on fhe amounf of compufafion fime 
and floafing-poinf operafions (ELOP) for fhe resulfs in 
Eig. I^a) are given in Appendix 

7 Conclusions 

In fhis paper, we have sfudied fhe d 5 mamic service 
placemen! problem for MMCs wifh mulfiple service 
insfances, where fhe fufure cosfs are predicfable within a 
known accuracy. We have proposed algorithms for bofh 
offline and online placemenfs, as well as a mefhod for 
finding fhe opfimal look-ahead window size. The perfor¬ 
mance of fhe proposed algorifhms has been evaluafed 
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(a) (b) 

Figure 6. Results with real-world traces (where the costs 
are summed over all clouds, i.e., the A{t) values): (a) 
Actual costs at different time of a day, where (3 = 0.4 for 
the proposed method E. The arrows point to the average 
values over the whole day of the corresponding policy, (b) 
Actual costs averaged over the whole day. 

both analytically and using simulations with S 5 mthetic 
instance arrival/departure traces and real-world user 
mobility traces of San Francisco taxis. The simulation 
results support our analysis. 

Our results are based on a general cost function that 
can represent different aspects in practice. As long as 
one can assign a cost for every possible configuration, 
fhe proposed algorifhms are applicable, and fhe fime- 
complexify resulfs hold. The opfimalify gap for fhe on¬ 
line algorifhm has been analyzed for a narrower class of 
functions, which is still very general as discussed earlier 
in fhe paper. 

The fheorefical framework used for analyzing fhe 
performance of online placement can be extended to 
incorporate more general cases, such as those where 
there exist multiple t 5 rpes of resources in each cloud. We 
envision thaf fhe performance resulfs are similar. We also 
nofe fhaf our framework can be applied fo analyzing a 
large class of online resource allocafion problems fhaf 
have convex objecfive funcfions. 
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Appendix A 

Summary of Notations 

The main notations used in this paper are summarized 
in Table U 


Appendix C 

Proof of Proposition]^ 

We first introduce a few lemmas, wifh resulfs used 
lafer in fhe proof. 


Appendix B 

Proof of Proposition]^ 

We show fhaf problem ||^ can be reduced from fhe 
parfifion problem, which is known fo be NP-complefe 
Corollary 15.28]. The partifion problem is defined 
as follows. 


Definition 1. (Partition Problem) Given positive integers 

vi,V 2 , is there a subset S C {1,2,...,M} such that 

the complement set of S? 


Similarly to the proof of |29 Theorem 18.1], we define 
a decision version of fhe bin packing problem, where we 
assume fhat fhere are M ifems each wifh size 


A 

CLi — 


2v, 




for all i G {1,2,..., M}, and fhe problem is fo defermine 
whefher fhese M ifems can be packed info two bins each 
with unit size (i.e., its size is equal to one). It is obvious 
that this bin packing decision problem is equivalent to 
the partition problem. 

To solve the above defined bin packing decision prob¬ 
lem, we can sef to = t, T = 1, and K = 2 in Because 
we affempf fo place all ifems, we sef = {1,2} for all 
i. By definition, — 0 for t — 1. We omif fhe 

subscripf t in the following as we only consider a single 
slof. We define atxk = Oi for all A, k, and define 


Uk{y) 


ey, if y < 1 

^{y-l)+€, ify>l 


(27) 


where c = -, and e > 0 is an arbifrary consfanf. 

Because Vi is a positive infeger for any i, we have fhaf 
^ = 2vi is always a positive integer, and f Vj 

is also always a positive integer. It follows fhaf y can 
only be infeger multiples of c (where we recall fhaf y 
is fhe sum of for fhose ifems i fhaf are placed in fhe 
bin), and fhere exisfs a posifive infeger d = r'i 

such fhaf c'c = 1. Thus, when y > 1, we always have 
y—1 > c. Therefore, fhe choice of Uk{y) in ^7\ guaranfees 
fhaf Uk{yk) > 3e > 2e whenever bin k (k G {T, 2}) exceeds 
ifs size, and X]fe=i when no bin has exceeded 

ifs size. Af fhe same time, Uk(y) satisfies Assumption]^ 
as long as c < 2. 

B^ fhe definition of c, we always have c < 2 because 
r -7 > 1- To solve fhe bin packing decision problem 
defined above (fhus fhe partition problem), we can solve 
© with the above definitions. If fhe solution is nof 
larger fhan 2e, fhe packing is feasible and fhe answer 
fo fhe parfifion problem is "yes"; ofherwise, fhe packing 
is infeasible and fhe answer fo fhe parfifion problem is 
"no". If follows fhaf problem (j^ is "af leasf as hard as" 
fhe parfifion problem, which proves fhaf l|^ is NP-hard. 


Lemma 2. For any instance j and configuration sequence A, 
we have 


dD 


dx 


!>■ 


(x) = Vy,zD(y,z) • (ajA,bjA) 


(28) 


Proof. 


dD 




dx 


iA 


w = E 


t^to 
K K 


K 


Y (x). (x) + 


EEaTTTTT 


/=1 

tQ-\-T—l 

= E 

t^to 
K K 

EE 


dzkiit) 


iA 


K 

E 

fc=i 


dD 


dyk(t) 


(x) • ajxk{t)+ 


dD 


(x) • bjxkiit) 


^ ^ dzki (t) 

= Vy,^i5(y,z) • (a^A.bjA) 

where we recall thaf yk{t) and Zki{t) are functions of Xjx 
for all j and A, thus they are also functions of vector 

X. □ 

Lemma 3. For any instance j and configuration sequence A, 
we have 


VxD (x) • X = Vy,zD (y, z) • (y, z) 


(29) 


Proof. 


\7^D (x) • X 

aga ^ 

M 

= E E (y’ ■ (^jA, hjx) ■ xjx 

j = l aga 

( M 

E E (ajA.bjA) -XjX 

j=l aga 

= Vy,zD(y,z) • (y,z) 

where fhe second sfep follows from Lemma fhe lasf 
sfep follows from fhe definition of vecfors y, z, a^x, bjx- 

□ 


We infroduce some addifional nofafions fhat are used 
in the proof below. Recall fhaf fhe values of vecfors x, 
y, and z may vary over fime due fo service arrivals and 
deparfures. Lef x^'’^ yj^\ and zj'^^ respectively denofe fhe 
values of x, y, and z immediately after insfance j is placed; 
and lef yj'L^, and z^{^^ respecfively denofe fhe val¬ 
ues of X, y, and z immediately before insfance j is placed. 
We note fhaf fhe values of x, y, and z may change after 
placing each insfance. Therefore, fhe nofions of "before". 
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Table 1 

Summary of main notations 

Notation 

Description 

_A 

Defined to be equal to 

Dot-product 

K 

k,l £ {1,2,..., K} 

t 

T 

M 

Total number of clouds 

Cloud index 

Timeslot index 

Size of look-ahead window 

Total/maximum number of service instances under consideration 

i 

^(fo, • ■■, fn ) 

U{t,n{t)) 

^7r(t-l,p(f) 

€{t) 

Service instance index 

Configuration matrix for slots {to, written in tt for short 

Local cost in slot t 

Migration cost between slots t — 1 and t 

Sum of local and migration costs in slot t when following configuration nit — 1, t) 

Actual cost (actual value of C'„(t_i,t)(t)) 

Predicted cost (predicted value of C'„(t_i^t)(t) when prediction is made at slot to) 

Equal to rna,XTr(t-i,t),to + t) — _i.~^{to + t)^, the maximum error when 

looking ahead for r slots 

A 

A £ A 

Set of all possible configuration sequences 

Configuration sequence (when considering a particular instance i, it is equal to the ith 
column of tt) 

Ai c A 

Subset of configuration sequences that conform to the arrival and departure times of 
instance i 

XiX 

^iXk (^) 

Binary variable specifying whether instance i operates in configuration sequence A 

Local resource consumption at cloud k in slot t when instance i is operating under 
configuration sequence A 

biXkl{t) 

Migration resource consumption when instance i operating under configuration sequence 

A is assigned to cloud k in slot t — 1 and to cloud 1 in slot t 

yk{t) 

Zki{t) 

Equal to 'l2xeA'^i^kit)xix, sum local resource consumption at cloud k 

Equal to J2iLi'l2xeA^i^ki(t)xix, sum migration resource consumption from cloud k to 
cloud 1 

Uk,t (yk(t)) 

Wkut iVkit- l),yi{t),Zki{t)) 

is)h^h^ (or 

y 

z 

Local cost at cloud k in timeslot t 

Migration cost from cloud k to cloud 1 between slots t — 1 and t 

The {hi, /i 2 )th (or {hi, h 2 , /isjth) element in an arbitrary vector or matrix g 

Vector with elements (y)j,j = yk{t) 

Vector with elements = Zki{t) 

X 

Vector with elements = Xix 

SiiX 

hix 

D{x), 5(y,z) 

(f), %!) 

r 

Vector with elements {aix)f^t = aixk{t) 

Vector with elements {hix)f,n = hixki{t) 

Sum (predicted) cost of all T slots, defined in 

Parameters related to the performance gap, defined in |ll| and 1 12 

Competitive ratio of Algorithm 

a 

F{T) 

G{T) 

e{T) 

Parameter related to the migration cost, defined in |l7| 

Equal to e(f — to), the sum-error starting from slot to up to slot to -1- T — 1 

The continuous time extension of F{T), see Section [s^ 

Equal to (r-n)G(T)+o-, upper bound in > 19 i after replacing F{T) with G{T) 


Note: The timeslot argument t may be omitted in some parts of the discussion tor simplicity. Vector elements are referred to 

with multiple indexes, but we regard vectors as single-indexed vectors for the purposes of vector concatenation (i.e., joining 

two vectors into one vector) and gradient computation. 
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"after", and "time" (used below) here correspond to the 
sequence of service instance placement, instead of the 
actual physical time. 

We then introduce vectors that only consider the 
placement up to the jth service instance, which are 
necessary because the proof below uses an iterative 
approach. Let x^, yj, and zj respectively denote the 
values of x, y, and z at any time after placing instance 
j (where instance j can be either still rurming in the 
system or already departed) while ignoring the placement 
of any subsequent instances j' > j (if any). This means, 
in vector kj, we set (xj)._^ = Xix for any i < j and 
A, and set = 0 for any i > j and A, although 

the value of Xi\ at the current time of interest may be 
non-zero for some i > j and A. Similarly, in vectors 
and Zj, we only consider the resource consumptions up 
to instance j, i.e., (y^O^t = LLi and 

i'^j)kit - ZLi ZacA biXki{t)xix for any k, I, and t. 

We assume that the last service instance that has 
arrived before the current time of interest has index M, 
thus X = xm, y = Ym, and z = zm- 

Because an instance will never come back after it has 
departed (even if an instance of the same type comes 
back, it will be given a new index), we have y^-i < 
yj'^i and Zj-l < where the inequalities are defined 
element-wise for ^e vecton _ , . 

Define Vj = D — D to denote 

the increase in the sum cost D{y,z) (or, equivalently, 
D(x)) at the time when placing^service j. Note that after 
this placement, the value of D{yj,Zj) — D(yj_i,Zj_i) 
may vary over time, because some services i < j may 
leave the system, but the value of Vj is only taken when 
service j is placed upon its arrival. 

Lemma 4. When Assumption is satisfied, for any M, we 
have 

M 

D (xm) < ^ Vj (30) 

i=i 

Proof. Assume that service j takes configuration Aq af¬ 
ter its placement (and before it possibly unpredictably 
departs), then y]-^^ - y^^}^ = a^Xo and z^^^ - z’f}^ = 
bjAo- For any time after placing instance j we define 
Ayj = yj — yj-i and Azj = Zj — zj-i- We always 
have Ayj = ajx„, Azj = b^Ao, if instance j has not yet 
departed from the system, and Ay^ = Azj — 0 it j has 
already departed from the system. 

Noting that D (yj,Zj) is convex non-decreasing (from 
Lemma [^, we have 


D{yj, 

Zj) - 

- D {yj- 

-i,Zj-i) 



= D {yj-i 

+ ^yj> 

Zj_i + Azj) - 

-D(yj_i,Zj_i) 


<D{ 


+ Ay^i 

zf_^ + Azj) 

-5(y“.,z®.) 

(31) 

<b{ 

fA 

+ ^jAo 1 

+bjAo) 

-5{y”_..z»_,) 

(32) 

= b{ 

' U) 

A 3 ’ 

A) - 


) 



where inequality < |3l| is because y^-i < ^j-i < 

'^91 1 (see discussion above) and due to the convex 


non-decreasing property of D{yj,Zj); inequality (32 i is 
because Ayj < ajXg, Azj < bj^o arid also due to the 
non-decreasing property of D {yj,Zj). 

We now note that D (xq) = 0, where xq = 0 and 0 is 
defined as a vector with all zeros, thus yo = zq = 0. We 
have 

M M 

H^ H (yj ’^ (yj-i:zj-i) 


i=i 


i=i 


= D (xm) - D (xo) = D (xm) 


□ 


Lemma 5. When Assumption is satisfied, for any j and A, 
we have 

Vj < (xAf) (33) 

OXjX 

where f is a constant satisfying ©■ 

Proof. Assume that service j takes configuration Aq af¬ 
ter its placement (and before it possibly unpredictably 
departs). Because we perform a greedy assignment in 
Algorithm]^ we have 

= 5 (y“i + “)»«• 4-1 + *>)»•) - ^(yj-i’4-i) 

< 5 (y<i>, + a„. + b„) - 5 (y®,, a®,) 

for any A S A^. 

Then, we have 


D 


(y?l 


ajA, 


+ b,A)-D(y«„z?2,) 


< (y^li + ajx, + bjA) ■ (a^A, B^a) (34) 


< Vy,zD(y 

max + ; Zjnax + hjx) ■ iajx,hjx) (35) 

< </>Vy,zi5 (yM, zm) • (ajA, hjx) (36) 

, dD 


(xm) 


(37) 


dxjx 

where "•" denotes the dot-product. The above relation¬ 
ship is explained as follows. Inequality ( [34| follows from 
the first-order conditions of convex functions Section 

3.1.3]. The definition of ymax and z^ax in Proposition 
1^ gives (35 1 . The definition of f in ( [TT) gives ( [36) . 
Equality (37 1 follows from Lemma [^ This completes the 
proof. □ 


Using the above lemmas, we now proof Proposition [^ 


Proof. (Proposition|^ Due to the convexity of D(x), from 
the first-order conditions of convex functions Section 








20 


3.1.3], we have 
5((/)'!/'Xm) - 5(xm) 

> Vx£> (xm) • - Xm) 

= (l)ipV^D (xm) • X^ - Vxi5 (xm) Xm 


M 


dD 


= EE - (xm) - Vx£> (xm) • XM 

•_1 -V A CfXiX 


2=1 AgA 
M 


^2=1 AeA 


2 A 


ijj 


M 


\2=1 AgA 

( M 

E E 

i=l AgA 


V' 


M 


Vxil ( 5 ^) • Xm 


Eq- <|l3) = -lA XI “ 

\i=l 

- ~ D{xm)^ 


> 0 


l(2/) = E 

p=0 

n 

^(2/) = E 

p=0 






lim „ , . 

y ^+00 -2(y) w. 

Proof. When 11 = 0, we have 
lim 

v ^+00 ~2(y) uj. 


m 


( 0 ) 


When n > 0, we note that limy_>+oo '^i{y) — +00 and 


limy_i.+oo ^ 2 ( 2 /) = + 00 , because > 0 and > 0. 




(38) 

(39) 

(40) 


We apply the L'Hospital's rule and get 

^ 1 ( 2 /) 


lim _ , , 

y ^+00 ^ 2 ( 2 /) 


= lim 


rfSi(y) 

dy 


y —>-+oo *^‘^2 (y) 
dy 


/ I ( \ (xm) • Xm I - ., 

= V' ( X E (xm)-- I (41) 


= lim 

p—>+00 


Ep=ip4'’^2/ 


(p)„,p-i 


(p),,p-i 


(47) 


where x*^ is the {i, A)th element of vector x’fj. From 
Lemma we have 


Suppose we have 
"-i{y) 


lim „ 

p-).+oo ^2[y) 


= lim 


EE {Ul-=Vp-m))u:[^^y^- 


p—n 


eE (nr=o(p - m)) 


Eq.@>^(XE---.-EE^E^l (42) 


(48) 


V' 




From the constraint EagA ~ ^ ^^*4 the definition of 
f, we gef 


(44) 


(45) 

(46) 


which equals fo (471 for n = 1. For 1 < n < 12, we nofe 
fhaf 12 — n > 0, hence fhe numerafor and denominator 
in fhe righf hand-side (RHS) of ( [4^ sfill respecfively 
approach -l-oo when y -l-oo (because > 0 and 
UJ 2 ^ > 0). Lef 'l'(n) denote fhe RHS (481, we can reapply 
fhe L'Hospifal's rule on 4'(n), yielding 

(n 


4'(n) = lim 


P— 


p—n-\-l 


r(n+l)-l 

lm=0 




2/—F + OO 


E f2 

p—n-\-l 

= 4'(n -I- 1) 

which proofs fhaf ( [48) holds for 1 < n < 12. Therefore, 

,(f2) 


(n 


(n+l)-l 

m—0 


[p — m)J uj!f^yP~('^+^') 


where fhe lasf equalify follows from Lemma [^ This gives 

Equafion ( jlO) follows from fhe facf fhaf yk,j{t) and 
Zki,j{t) are both linear in Xi\. 

The last equality in ( [T^ follows from Lemmaj^and the 
fact that D {x) = D (y, z) as well as x = xm, y = Ym, 
and z = zm- D 

Appendix D 

Proof of Proposition]^ 

Lemma 6. For polynomial functions Si(y) and 22 ( 2 /) in the 
general form: 


lim ih( = 'Kn) = X! 

p^+00 -2(2/) p!w2^ 


UJ. 


(O) 


□ 


Lemma 7. For variables 0 <y <y ',0 <yk <yk ,0 <yi < 
2/0 0 < Zki < we always have 


duk t j \ ^ duk t 

-[y)< 


(2/') 


(49) 


dy dy 

dwki,t, , , dwki,t , , , , , 

-7^{yk,yi,zki) < —^{yk^Vi^Zki) (50) 

where T stands for either yk, yi, or Zki- 

Proof. We note that 

duk,t, s sr^ (p) P-1 

^iy) = 2 ^prk,y 


from which (49' follows directly because jlfl > 0. We 
then note that 


where the constants > 0 and ujlf'^ > 0 for 0 < p < 12, 
while uj^'^ > 0 and uj^ > 0, we have 

"i{v) 


dw 


dyk 


''^-{yk:yuzki) = EEE p^'^ki"f^''''’'’vk 


Pi Pi P3 


0. Similarly, (511 


from which (50 1 follows for T = yk because k 


(Pl,P2.P3) 
kl.t 


> 


also follows for T = yi and T = zu- □ 


Lemma 8. Let 12 denote the maximum value of p such that 
either > 0 or where pi + P 2 + P 3 = P- 

Assume that the cost functions are defined as in and 
then for any constants S > 0, B > 0, there exist sufficiently 
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large values of y, yu, yuZki, such that 

duk 


dwk 


dy 

d«fc , 


iy) 

dy (y)-y 

uk,t{y) 

ax * (yk B,yi + B, Zki + B) 

dwki 


ax {.ykTUii Zki) 
dwk 


dT 


iyk,yi,Zki) ■ T 


<1 + 5 


<11 + 5 


Wk,t{yk,yuzki) 
for any k,l,t, where T stands for either yt^, yi, or zu- 


dui. 


^^(y + B) _E%^Pilf,Uy + B) 


jp) 


xp-l 


duu 


dy 


'-{y) 


Y:Upii^}y^-^ 

T.%iPik}y^ 


d^kp y „^,(p) 


dy 


ukAy) Y.%iiSyp 

According to Lemma we have 

‘^{y + B) 


lim 

y—f+oo 


duu 


lim 

y—^-\-oo 


dy 

duk,t 

dy 


Ky) 

{y) ■ y 


UkAy) 


^aP!P 


= 1 


= n' 


Pi P2 P3 


= l^A')kAr 

p=i 

where the constant 

^/^(p) A 


/\(p) 


(,^'\yp> A \ ' ^(P1.P2,P3)xP1XP2AP3 

)kl,t — ^kl,t ‘il S2 S3 


lim 

q—>-\-oo 


dq 
dwki, 




dq 


A) 


^'aS' 


= 1 


lim 

q—>-+oo 


dq 


-{q)-q 


wkAq) 


<1 + 5 (51) 

<11 + 5 (52) 


7f;^ 


= n" 


(61) 


where B’ = . ^ . . 

min{Ci;C2;C3} 

According to the definition of limifs, for any 5 > 0, 
fhere exisf sufficienfly large values of y and q (fhus 
yk,yi,Zki), such fhaf 


(53) 


(54) 


"PPPA + B) 


du 


dy 

du^t 


iy) 

dy iy)-y 
UkAy) 


Proof. Lef H' denofe fhe maximum value of p such fhat 
ypl > 0, we always have H' < H. We nofe fhaf 


dq 
dwki 


dq 

dwki 


dq 


Aq) 

-{q)-q 


WkAq) 


<1 + 5 


< ll' + 5 


<1 + 5 


< H" + 5 


(62) 

(63) 

(64) 

(65) 


(55) 


(56) 


(57) 


(58) 


for any k, I, t. 
Because 


dwh 


TA + B') ’^{qPB') 

(q) 


dwki, 

dq 


iq) 


dwki.t 

d(C<?) 


dwki,t, . dwki,t, . ^ 

for any C > 0/ we can also express fhe bounds < [M) and 
in ferms of yk,yi,Zki, yielding 

diBki t 

’ 111 ^ -t— ( 1 / > 111 —t- I o I > /, 7, j -t— 1 o / > I 


dT 


' (j/fc + CiBiyi + ABf Zki + AB') 


dwki,-, 

dT 


iyk,yi,zki) 


< 1 


( 66 ) 


where we nofe fhaf affer expanding fhe numerafor in 
fhe RHS of 1551, fhe consfanf B does nof appear in fhe 
coefficienf of 

Now, define a variable g > 0, and we lef yk = Ci9j yi = 
C 29 , Zki = Cs?/ where Cij C2, Ca > 0 are arbifrary consfanfs. 
Using Ci;C2,C3/ q, we can represenf any value of 
{yk,yi,Zki) > 0. Wifh fhis definifion, we have 

wki,t (q) = wki,t {Ciq, Aq, Csq) 

= EEE ^(pi .P2 ,P3 ) ^Pl ^P2 ^P3 gPl +P2 +P3 


dwki, 

dT 


{yk,yi,zki) ■ T 


, , <ll" + 5 (67) 

wkAyk,yi,zki) 

where T sfands for eifher yk, yi, or zu- According fo fhe 
definition of B', we have B < (iB', B < AB', B < AB'■ 
From Lemma we have 

Z {yk + B,yi + B, Zki + B) 


< 


dT 
dwki.t 


dT 


{yk + AB',yi + AB', Zki + AB') (68) 


Combining | [68) wifh 166 1 and nofing fhaf U' < H and 
U" < n, fogefher wifh (62\ , { 631, and ( |67| , we gef 

□ 


(59) 


{(Pl,P2.P3):Pl+P2+P3=P} 

and n" is defined as fhe maximum value of p such fhaf 
{n')pP > 0, we always have U" < H. Nofe fhaf 1591 is in 
fhe same form as (T^ . Following fhe same procedure as 
for obfaining < [57| and < [58| l, we gef 

^^B^{q + B') 


Lemma 9. For arbitrary values > 0 and d 2 ,n > 0 for 
all n = 1,2, where und 'd 2 ,n ure either both zero 
or both non-zero and there exists n such that und 1 ) 2 ,n 
are non-zero, if the following bound is satisfied: 

A, 

max —— 

{nG{l,...,N}:'di^ri^0,'d2,n^0} 'i/2 ^ 


< o 


(60) 


then we have 


for any a;„ > 0. 


J2n=l Wndl^n 

J2P=1 WnA.n 


< 0 
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Proof. Because for all n, we have 

N N 

n—1 n—1 

yielding the result. □ 

Lemma 10. When Assumption is satisfied and the window 
size T is a constant, there exists a constant B >0 such that 

{ymax + SiiX, 2,,„ax + L^a) “ (y, z) < Be (69) 
for any i and any A G A^, where e = [1,1] is a vector of 
all ones that has the same dimension as (y, z). 

Proof. We note that 

(ymax “1" ^iA;Zn\ax 9" hix) (y^z) 

— (ymax “t” ^max^yi Z^ax ~t” ^max^z )-(y.z) (70) 

(^max {BdT + l)ey, 

^max {BdT+l)e^) (71) 

< max {omax (BdT + 1); bmax {BdT + 1)} • e (72) 

where ey = and are vectors of 

all ones that respectively have the same dimensions as 
y and z. Inequality ( [70) follows from fhe boundedness 
assumption in Assumption Inequality < [7T) follows by 
nofing fhaf fhe gap befween (ymax, Zmaxjand (y,z) is 
because of insfances unpredicfably leaving fhe sysfem 
before fheir maximum lifetime, and fhaf fhere are af 
mosf T slofs, af mosf Bd insfances unpredicfably leave 
fhe sysfem in each slof (according fo Assumption |^. 
Inequalify ijT^ is obvious (nofe fhaf fhe maximum is 
faken elemenf-wise). 

By seffing B = max {Umax {BdT + 1); Vax {BdT +1)}, 
we prove fhe resulf. □ 


We now proof Proposifion 


Proof. (Proposition We note that D (y, z) sums ^ 
Uk.tlvk) and Wki,t {Vk^yuZki) over t, k, I, as defined in 0. 

The numerafor in fhe RHS of ( [TT| | can be expanded 
info a sum confaining ferms of eifner 

djlLk f , , \ \ 

’ ((ymax + a*A)fct) 


dy 


or 


((ymax + ^ix)kt ) (ymax + ^ix)it , (Zmax + ^ix)kit) 


dWkl.t 

dr 

where T sfands for eifher yk{t), yi{t), or Zki{t), with 
either aixk{t) or bixki{t) as weights. Because Assumption 
0is satisfied, according fo | [69) in Lemma 10 we have 

(ymax T ^ix)kt — yk{f) T B 
(Zmax T ^ix)klt — Zkl{t) T B 

for all k, I, t. From Lemma 0 we have 

{yk{t) +B)> ((ymax + a,x)kt) 


and 

dwki^ (yfc(f) + B, yi{t) + B, Zki{t) + B) 


> 


dT 
dwki.t 


dT 

Therefore, if 


((ymax + ^ix)kt ’ (ymax + ^ix)it , (Zmax + ^ix)klt) 


f > 


Vy,zB ((y, z) + Be) ■ (a^A, b.x) 
Vy,z5(y,z) • (a,A,b,A) 


(73) 


fhen (TTj l is always satisfied. Similarly fo fhe above, fhe 
numerafor in fhe RHS of | [73) can be expanded info 
a sum confaining ferms of eifher {yk{t) + B) and 

{yk{t) + B, yi{t) + B, Zki{t) + B) wifh eifher aixk{t) 
or bixki{t) as weighfs. 

Again, fhe denominafor in fhe RHS of 0 (or equiv¬ 
alently, | [73) ) can be expanded into a sum containing 
terms of eifher ^^{y{t)) or ^^^^{yk{t),yi{t), Zki{t)), 
wifh eifher a^xk{t) or hxkft) as weighfs. 

For any given i,\, fhe ferms ^^^^{yk{t) + B) 
and ‘^^{yk{t)) have fhe same weighf a^xk{t), 
and ^^^'^{yk{t) + B,yi{t) + B,Zki{t) + B) and 
^^‘^{yk{t),yi{t),Zki{t)) have fhe same weighf b,xki{t)- 
According fo Lemmas 0 and 0 for any i5 > 0, fhere exisf 
sufficienfly large values of y and z, such fhat 


RHS of 1731 < l + (5 


Following a similar reasoning, we know fhaf, for any 
(5 > 0, fhere exisf sufficienfly large values of y and z, 
such fhaf 

RHS of @ < fl + 5 

We assume sufficiently large y, z in the following, in 
which case we can sef f = 1 + S and tjj = Pt + 6 while 
satisfying < [73) (fhus (TT) ) and | [T^ . 

We fhen nofe fhaf from (T^ , < |16) , and fhe definifion of 
n, we have 

D{(jnPyi*) < {(jnl})^b{x*) 

= {{l + 5){^ + 5))^b{rT) 

= {n^ + 5')b{^*) 


where 5' = 511 -1- 5 -I- <5^ > 0 is an arbifrary consfanf 
(because 6 is an arbitrary constant). The first inequality 
is because of > 1 and D{(j)'tjj'x.*) is a pol 5 momial of 
(fn/jx* wifh maximum order of fl, where we nofe fhaf y 
and z are bofh linear in x. 

We fhen have 

ThI<®^=SI='+i' (74) 

D{x*) D{x*) 

Unfil now, we have shown fhaf 0 holds for suffi¬ 
ciently large y and z. According to Assumption 0 the 
number of insfances fhaf unpredicfably leave fhe sysfem 
in each slof is upper bounded by a consfant Bd. If follows 
fhaf y and z increases wifh M when M is larger fhan 
a cerfain fhreshold. Therefore, fhere exisfs a sufficienfly 
large M, so fhaf we have a sufficiently large y and z that 
satisfies <[74). 
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Hence, the competitive ratio upper bound can be 
expressed as 

r = max r(I(M)) < + 5' (75) 

I(M) 

for sufficiently large M. 

According to the definition of fhe big-O nofafion, we 
can also wrife 

r = 0(1) (76) 

because ff and 6' are bofh consfanfs in M. □ 


Appendix E 

Proof of Proposition]^ 


Define Tmax > 1 as an arbitrarily large timeslot index. 
We note that there are full look-ahead windows of 
size T wifhin fimeslofs from 1 to Tmax/ where [xj denotes 
the integral part of x. In the last window, there are Tmax — 
T • slots. We have 


F Tmax — T ■ 




T 


I ^max I 

< y ^ ^ ^ FiT) (77) 


because F{T) is convex non-decreasing and F{0) = 0. 

For the true optimal configuration tt*, according to 
the definitions of e(T) and F(T), fhe difference in fhe 
predicfed and acfual sum-cosfs satisfies 


t=i t=i 

Trr, 


< 


T 


F{T) + F T„ax - T 


Trr, 


T 


< F{T) 


T 


(78) 


where fhe lasf inequalify follows from | [77) . Similarly, for 
fhe configuration tt^ obfained from predicfed cosfs, we 
have 

^max Tmax rp 

E - E <^nT) (79) 

t=i t=i 


In the following, we esfablish fhe relationship befween 
TT* and TTp. Assume fhaf, in we neglecf fhe migrafion 
cosf af fhe begirming of eadi look-ahead window, i.e. 
we consider each window independenfly and fhere is 
no migrafion cosf in fhe firsf fimeslof of each window, 
fhen we have 


E 




t=i 


where fhe consfanf F > 1 is fhe compefifive ratio of 
solving 1^. This holds because fhere is no connecfion 
befween differenf windows, fhus fhe opfimal sequences 
(considering predicfed cosfs) obfained from ||^ consfifufe 
fhe opfimal sequence up fo a facfor F for all timeslofs 
[l,Tmax]- Now we relax fhe assumption and consider 
fhe exisfence of migrafion cosf in fhe firsf slof of each 
window. Nofe fhaf we carmof have more fhan + 1 
windows and fhe firsf fimeslof t = 1 does nof have 


migrafion cosf. Thus, 

Tmax Tmax rp 

t=i t=i 

The bound holds because regardless of fhe configurafion 

in slof fo ~ 1/ the migration cost in slot to cannot exceed 


By multiplying F on both sides of (781 and summing 
up the result with 1801, we get 


max max rp 

Y^D^^(t)-TY,A^^(t)<^07F{T)+a) (81) 


Summing up < [7^ with | [8T| |, dividing both sides by T^ax/ 
and taking the limit on both sides yields the proposition. 


Appendix F 

Proof of Proposition]^and Corollary]^ 

Taking the derivative of T>(T), we get 


dT 


iT + l)T 


(PG(T) 

dT^ 


> 0 


(82) 


where the last inequality is because G(T) is convex. This 
implies that $(T) is non-decreasing with T. Hence, there 
is at most one consecutive interval of T (the interval may 
only contain one value) such that < [23) is satisfied. We 
denote this interval by [T-jT+J, ancTa specific solution 
to < |23| is Tq € 

We note that and $(T) have the same sign, 
because ^ 0 yields $(r) ^ 0 and vice versa, which 
can be seen from ( |2^ and p3) . When T < T_, we have 
$(T) < 0 and hence < Q; when T > T+, we have 
$(T) > 0 and hence > 0. This implies that In 9, thus 
9{T), keeps decreasing with T until the optimal solution 
is reached, and afterwards it keeps increasing with T. It 
follows that the minimum value of 9(T) is attained at 
T G [r_,T+]. Because Tq G [T_,r+] and T* is a discrete 
variable, we complete the proof of the proposition. 

Noting that we do not consider the convexity of 9{T) 
in the above analysis, we can also conclude the corollary. 


Appendix G 

Additional Simulation Results 

We study the computational overhead of the pro¬ 
posed algorithm for obtaining service configurations 
corresponding to the results shown in Fig. |^a). The 
simulations were run in MATLAB on a laptop with 
Intel(R) Core(TM) i5-4300U CPU, 4 GB memory, and 64- 
bit Windows 10. We focus on the computation time and 
amount of floating-point operations (FLOP) for every 
computation of the next T-slot configuration for one 
instance. We compare the performance between the pro¬ 
posed approach (E) and the precise future knowledge 
scenario (D). We do not compare against MMC-only and 
backend cloud-only approaches (A, B, and C) because 
those approaches are much simpler but produce a sig¬ 
nificantly higher cost than approaches D and E (see Eig. 
I^a)) thus it is not fair to compare their computational 
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Table 2 

Statistics of computation time and FLOP count 


Performance measure 

Approach 

Sum 

Mean 

Sfd. dev. 

Maximum 

Compufafion fime (seconds) 

Prec. fuf. knowledge (D) 

4.46 X 10® 

4.39 

4.78 

50.9 

Proposed (E) 

6.45 X 10® 

1.84 

0.72 

3.22 

PLOP counf 

Prec. fuf. knowledge (D) 

1.71 X 10®° 

1.68 X 10®' 

1.85 X 10®' 

2.03 X 10® 


Proposed (E) 

2.14 X 10®° 

6.08 X 10° 

2.43 X 10° 

7.56 X 10° 




Figure 7. Cumulative distribution functions (CDF) of com¬ 
putation time and FLOP count. 


overheads. The FLOP counts are obtained using the 
toolbox in [Al] (references for appendices are lisfed af 
fhe end of fhis documenf). 

We collecfed sfafisfics of fhe compufafion fime and 
FLOP counf for each execution of fhe algorifhm fo 
compufe fhe nexf T-slof configuration when using ap¬ 
proaches D and E. Fig. shows fhe cumulafive dis- 
fribufion funcfion (CDF), where CDF(x) is defined as 
fhe percenfage of observafions (i.e., compufafion fime or 
FLOP counf) fhaf has a value smaller fhan or equal fo x. 
Table l^lisfs fhe sum (for all executions), mean, sfandard 
deviation, and maximum values. 


We see fhaf in ferms of compufafion fime, fhe pro¬ 
posed approach (E) can find fhe configuration within 
1.84 s on average. Recalling that each timeslot corre¬ 
sponds to a physical time of 60 s and fhe configurafion 
for mulfiple fimeslofs is usually found by a single exe¬ 
cution of fhe algorifhm, fhe fime needed for rurming fhe 
algorithm is relatively short. 

We note that because we ran the simulation in MAT- 
LAB and our code is not optimized for performance, 
our resulfs are pessimisfic and fhe algorifhm should run 
much fasfer with a better optimized code written in C, 
for insfance. This can be seen by fhe facf fhaf fhe average 
ELOP counf of fhe proposed approach is only 6.08 x 10® 
while modem personal compufers can easily complefe 
over 10® floafing-poinf operations per second (PLOPS), 
see fhe experimental results in |[A2]| for example. Server 
machines will be even more powerful. This means fhaf 
if we fully utilize fhe resource of a personal compufer, 
fhe proposed algorifhm should be able fo find fhe con¬ 
figurafion (for a single insfance) wifhin 0.01 s. Such a 
timescale will be sufficient for deploying newly arrived 
insfances on fhe cloud in real fime. Prom fhis PLOP 
count comparison, we can conclude that the proposed 
algorithm only consumes a small amount of processing 
capabilify so if is applicable in practice and scalable fo 
a reasonably large amount of users. 

We furfher nofe fhaf fhe proposed algorifhm may ben- 
efif from parallelization and performing fhe compufafion 
on GPUs. Por example, parallelization is possible for fhe 
"for all" loops befween Lines and and also befween 
Lines 10 and [15] in Algorifhm |^This can furfher expedife 
fhe process oTfinding configurations. 

Comparing approach D with the proposed approach 
(E), Table shows that the mean, standard deviation, 
and maximum values of fhe proposed approach is lower 
fhan those of approach D. This is because fhe proposed 
approach uses fhe optimal look-ahead window size, 
which is usually much smaller fhan fhe fofal number of 
fimeslofs in which fhe insfance remains active, whereas 
approach D is assumed fo have precise knowledge of 
fhe fufure and fherefore fakes info accounf fhe enfire 
duration in which fhe insfance is active. According fo 
fhe complexify analysis in Section 3.3 considering more 


fimeslofs in fhe opfimizafion causes higher algorifhmic 
complexify, fhus explaining fhe resulf. 

When looking af fhe sum values in Table fhe 
proposed approach has a similar buf slighfly larger 
compufafion fime and PLOP counf fhan approach D. 
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This is because approach D is assumed to know exactly 
when an instance departs from the system, whereas the 
proposed approach does not have this knowledge and 
may consider additional slots after instance departure 
as part of fhe opfimizafion process. However, fhe gap is 
nof large because fhe proposed approach has a relafively 
small look-ahead window; fhe insfance deparfure be¬ 
comes known fo fhe proposed approach af fhe begirming 
of a new look-ahead window. 

The facf fhaf fhe mean and maximum values (in 
Table 1^ of fhe proposed approach are much smaller fhan 
fhose of approach D also makes fhe proposed approach 
more suifable for real-time requesfs, because if can ap¬ 
ply a configuration quickly affer insfance arrival. More 
imporfanfly, we recall fhaf approach D is impractical 
because if assumes precise prediction of fufure cosfs and 
insfance deparfure times. 
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